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DNA encoding ovine adenoyinis (OAV 287) and its use as a viral vector 

Technica l Field 

The present invention relates to a new full length 
genomic clone derived from a benign adenovirus (OAV287) 
5 isolated from sheep in Australia. The present invention 
also relates to new viral vectors derived from the benign 
ovine adenovirus and also relates to the use of these 
vectors for the delivery and expression of nucleic acid 
sequences encoding functional RNA molecules or 
10 polypeptides to animals, 

Background pf the Invention 

Diseases caused by infectious agents and parasite 
infestations cause health problems and production losses 
in domestic animals but for many infectious agents no 
15 vaccine exists. Consequently, there are major research 
efforts worldwide to develop new vaccines which can 
protect against disease. 

While some protective antigens from infectious 
agents and parasites have been identified, their 
20 successful use as vaccines requires the development of 

systems which can effectively deliver the antigen to the 
host. A variety of recombinant gene expression vectors 
derived principally from the pox virus family have been 
employed as these are generally of low pathogenicity. 
25 Expression of the foreign protein following infection by 
the recombinant viral vector may stimulate a protective 
immune response in the host. 

However, no viral vector has all the attributes 
desirable for all situations. Some vectors are better 
30 suited to particular tasks than others because of their 
biological properties. For example, it has often proved 
difficult to stimulate an effective mucosal immune 
response which can protect against disease. In humans, 
adenoviruses have been given orally to vaccinate against 
35 respiratory disease (1). As this involves protection at 
mucosal surfaces adenoviruses clearly have potential in 
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this regard. Human adenovirus vectors have also been used 
to deliver genes to muscle (2) and other tissues. 
Although adenoviruses do not generally integrate their DNA 
into the cellular genome, nevertheless, the DNA persists 
5 and long term protein expression is observed. Expression 
of an appropriate antigen from such cells can generate a 
systemic immune response which may be protective against 
the homologous disease causing agent. 

Known adenovirus genomes are linear double-stranded 
10 DNA molecules which have an inverted terminal repeat 

sequence ( ITR) at each end and a protein covalently bound 
to the 5 '-terminal C residue (3). The genome sequence and 
structure has now been completely determined for human 
adenoviruses types 2, 5, 12 and 40 and partially for 
15 numerous others, including some animal isolates (see 
Genebank and EMBL Nucleic Acid databases). Human 
adenovirus type 2 was the first genome to be sequenced but 
broadly speaking its genome arrangement is conserved among 
other characterized adenoviruses i.e. early regions E1-E4 
20 and the structural protein homologues can be recognized in 
similar locations in the genome. In particular, the 
E1A/E1B region is located at the left hand end of the 
genome and region E4 is always located at the right hand 
end of the genome. Early region E3 is always located 
25 between the genes for structural proteins pVIII and fiber, 
although its size and complexity varies between species 
e.g. from 3kb with at least 10 open reading frames in 
human adenoviruses to approximately 0.7kb with only two 
significant open reading frames in murine adenovirus (4, 
30 5). E3 is a key region for the construction of 

recombinant viruses as it is non-essential for replication 
in vitro (6). The late, L region is expressed from the 
major late promoter, MLP and complex splicing generates 
families of mRNAs which code for most of the structural 
35 viral proteins. Proteins IVa2 and IX appear to have their 
own promoters . 
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Although there are some human viral vectors 
available for medical use there are few animal viral 
vectors suitable for use in veterinary applications. In 
order to obtain a more suitable animal viral vector the 
present inventors have purified an ovine adenovirus 
(OAV2 87) isolated from sheep in Western Australia. This 
ovine adenovirus is serologically related to bovine 
adenovirus type 7 but is genetically distinct from the 
bovine adenoviruses and other Australian ovine isolates, 
as shown by comparisons between the ovine and bovine 
adenoviruses, based on restriction enzyme profiles (8). 
The genome arrangement of the virus according to the 
present invention varies significantly from all other 
known adenoviruses. The adenoviral DNA molecule of the 
present invention is suitable for use in viral vectors 
capable of expressing a variety of polypeptides when used 
for veterinary applications. 
Summary of the Invention 

According to a first aspect, the present invention 
consists in an isolated DNA molecule comprising a nucleic 
acid sequence encoding the genome of ovine adenovirus 
(OAV287) substantially as shown in Figure 1 or a 
functionally equivalent nucleic acid sequence. 
Preferably, the nucleic acid sequence encoding the genome 
2 5 of the adenovirus is substantially as shown in Figure 1. 

In a further preferred embodiment of the first 
aspect of the present invention, the DNA molecule 
comprises a nucleic acid sequence encoding the genome of 
ovine adenovirus (OAV287) wherein a portion of the 
30 adenoviral genome not essential for the maintenance or 
viability of the native adenovirus deleted or altered. 

In a second aspect, the present invention consists 
in a DNA molecule including at least a fifteen nucleic 
acid base sequence being substantially unique to the ovine 
35 adenovirus (OAV287) nucleic acid sequence shown in Figure 
1. In a preferred embodiment of the second aspect of the 
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present invention, the at least fifteen nucleic acid base 
sequence encodes a functional element of ovine adenovirus 
(OAV287). Preferably, the functional element is selected 
from the group consisting of promoter, gene, inverted 
5 terminal repeat, viral packaging signal and RNA processing 
signal. The inverted terminal repeat of ovine adenovirus 
(OAV287) comprises the first 46 nucleic acid bases from 
the 5' end of each strand of the double stranded DNA 

genome of the virus. 
10 " ma third aspect, the present invention consists in 

a plasmid including the DNA molecule of the first or 
second aspects of the present invention. Preferably, the 
plasmid includes the DNA molecule of the first aspect of 
the present invention wherein the nucleic acid sequence 
15 encoding the adenoviral genome is linked to a nucleic acid 
sequence encoding an origin of replication and a further 
nucleic acid encoding a marker. Preferably, the nucleic 
acid sequence encoding the marker encodes for resistance 
to an antimicrobial agent. More preferably the 
20 antimicrobial agent is ampicillin. 

In a further preferred embodiment of the third 
aspect of the present invention, sequences encoding 
inverted terminal repeats of the adenovirus are joined. 

In a fourth aspect, the present invention consists 
25 in a viral vector comprising the DNA molecule of the first 
aspect of the present invention and at least one nucleic 
acid sequence encoding a non-adenoviral polypeptide or 
polypeptides . 

Preferably, nucleic acid sequence encoding the non- 
30 adenoviral polypeptide or polypeptides is derived from 
bacteria, viruses, parasites or eukaryotes. More 
preferably, the non-adenoviral polypeptide is rotavirus 
VP7sc antigen, the parasite polypeptide is 
Trichostrongylus colubriformis 17kD antigen, the 
35 Taenia ovis 45W antigen or the PM95 antigen from Lucilia 
cuprina . 
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in another form, the present invention consists in a 
viral vector comprising the DNA molecule of the first 
aspect of the present invention and at least one nucleic 
acid sequence encoding a functional RNA molecule. It will 
be appreciated by one skilled in the art that a functional 
RNA molecule can include a messenger RNA molecule, an 
antisense RNA molecule or a ribozyme. 

in a fifth aspect, the present invention consists in 
a method of delivering a DNA molecule having a nucleic 
acid sequence encoding a non-adenoviral polypeptide or 
polypeptides to a target cell comprising infecting the 
target cell with a viral vector according to the fourth 
aspect of the present invention such that the DNA molecule 
encoding the polypeptide or polypeptides is expressed and 
15 the polypeptide or polypeptides is produced by the target 
cell. 

in a sixth aspect, the present invention consists in 
a method for delivering a DNA molecule having a nucleic 
acid sequence encoding a non-adenoviral polypeptide or 
20 polypeptides to an animal comprising administering to the 
animal a viral vector according to the fourth aspect of 
the present invention such that the viral vector infects 
at least one cell of the animal and the infected cell 
expresses the DNA molecule encoding the polypeptide or 
25 polypeptides and produces the polypeptide or polypeptides. 
Preferably the animal is a grazing animal and more 
preferably the grazing animal is a sheep. 

in another form, the present invention consists in a 
method for delivering a DNA molecule having a nucleic acid 
30 sequence encoding a functional RNA molecule to an animal 
comprising administering to the animal a viral vector of 
the fourth aspect of the present invention having a 
nucleic acid sequence encoding a functional RNA molecule 
such that the viral vector infects at least one cell of 
35 the animal and the infected cell expresses the DNA 
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molecule encoding the functional RNA molecule and produces 
the functional RNA molecule. 

As used herein the term "functionally equivalent 
nucleic acid sequence" is intended to cover minor 
variations in the ovine adenovirus (OAV287) DNA molecule 
which, due to degeneracy in the DNA code, does not result 
in the molecule encoding different viral polypeptides. 
Further, this term is intended to cover alterations in the 
DNA code which lead to changes in the encoded 
polypeptides, but in which such changes do not 
substantially affect the biological activities of these 
viral polypeptides . 

As used herein the term "functional element" is 
intended to cover nucleic acid sequences that encode 
15 promoters, genes, inverted terminal repeats, viral 

packaging signals and RNA processing signals. It will be 
appreciated by one skilled in the art that unique 
sequences from ovine adenovirus (OAV287) that encode these 
functional elements may be useful in other systems 
20 including plasmids and non-ovine adenoviral vectors. 

In order that the nature of the present invention 
may be more clearly understood preferred forms thereof 
will be described with reference to the following examples 
and the accompanying drawings . 
25 Brief D ? srriDtin n of the Drawings 

Figure 1 is the nucleic acid sequence of the OAV287 
genome beginning at base 1 of the left-hand ITR. 

Figure 2 shows the arrangement of OAV287 genes based 
on homologies detected with Ad2 . Regions with question 
30 marks are tentative identifications because of the lack of 

obvious homology. 

Figure 3 indicates the major open reading frames in 
the proposed El region of OAV287. Asterisks show the 
location of possible initiation codons . A previously 
35 unidentified gene ( P 28kD) which codes for a processed 

structural protein is encoded on the complementary strand. 
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Figure 4 shows open reading frames in the region of 
the OAV287 expected to contain E3. However, E3 is missing 
as the gap between the pVIII and fiber genes is only 197 
nucleotides. The site at which the Apal/Notl polylinker 
was later inserted is indicated. 

Figure 5 shows the major open reading frames in the 
probable E3 region of OAV287. Asterisks show the location 
of potential initiation codons . The Sail site which was 
modified by end-filling and re-ligation and the 
alternative site at which a polylinker sequence was later 
inserted into the genome without loss of infectivity is 
indicated. 

Figure 6 is a scheme describing the construction of 
a plasmid (pOAV287Cm) containing a full-length clone of 

15 the OAV287 genome with pACYC184 sequences inserted in the 
Sail site. Filled in regions show OAV287 sequences. 
Cross-hatched sequences are derived from plasmids pUC13 or 
Bluescribe M13+ (Amp R ) , stippled regions from pSELECT 
(TetR) and open regions from P ACYC184 ( Cm* ) . Only the key 

20 restriction sites used for plasmid construction are 
indicated . 

Figure 7 shows a map of the plasmids pOAVlOO, 
pOAV200, pOAV600 and pOAV600S. Arrowheads indicate the 
ITRs and the approximate location of the major late 
25 promoter (MLP) . The mutated Sail site and sites at which 
the Apal/Notl polylinker sequences were inserted are 
indicated. Light hatching signifies modified Bluescribe 
sequences inserted in the Kpnl site. Linear, infectious 
genomes (dark hatching) are released by digestion with 
30 Kpnl. 

Figure 8 shows the results of screening ovine 
adenoviruses OAV100 and OAV200 rescued by trans fection of 
recombinant plasmids pOAVlOO and P OAV200 into CSL503 
cells. Portions of the genome spanning (A) the mutated 
35 SphI site in OAV100 and (B) the Apal /EcoRV/NotI polylinker 
insertion site in OAV200 were amplified by PCR together 
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with the corresponding regions from wild-type OAV2 87. The 
products were digested with SphI (A, lanes 3 6, 5) and 
Apal, EcoRV or NotI (B, lanes 3-5, and 8-10, 
respectively). (U) indicates undigested samples. 
5 Figure 9 is a map of a plasmid pMT used for the 

assembly of gene expression cassettes. Fragments 
containing the OAV287 major late promoter and tripartite 
leader sequences are linked and precede a multiple cloning 
site for the insertion of genes of interest. A tandem 
10 polyadenylation signal (AATAAA) follows. 

Figure 10 shows a summary of recombinant viruses 
which have been rescued from the corresponding infectious 
plasmids and the gene expression cassettes they carry. 
Cassettes were inserted into the OAV genome between the 
15 pVIII and fibre genes as indicated. 

Figure 11 shows the expression of (A) the T. ovls 
45W and L. cuprina PM95 antigens in CSL503 cells following 
infection of these cells with OAV205 and OAV210 viruses, 
respectively and (B) VP7sc expression in CSL503 and bovine 
20 nasal turbinate cells following infection with virus 

OAV204. (I) Infected cells (U) Uninfected cells. (M) 
indicates marker proteins of the sizes shown. 

Figure 12 shows expression of VP7sc in (A) CSL503 
cells and (B) rabbit kidney and bovine nasal turbinate 
25 cells following infection with OAV206 virus. (I) Infected 
cells. (U) uninfected cells. (M) indicates marker proteins 
of the sizes shown. 
n PS rripti n n of the Invention 

METHODS 

30 Growth and Purification of OAV287 

The virus, isolated from sheep in 1985, was obtained 
from R.L. Peet, Animal Health Laboratory, Department of 
Agriculture, Western Australia. The virus isolate was 
grown in sheep foetal lung cells (line CSL503) and twice 

35 plaque-purified under solid overlay before stocks were 
prepared. Virus was purified from CSL503 cells as 
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described previously (18, 22). DNA was extracted from the 
virus by digestion with proteinase K (23). 
Cloning of Genome Fragments 

Molecular techniques for manipulation, modification 
5 and transformation of plasmid DNA which were used in the 
work described below are described in (9) and similar 
publications. OAV287 DNA was digested with various 
restriction endonucleases including BaraHI, SphI , Smal and 
Sail to deduce the location of these sites (18). 
10 The adenovirus genome has a protein covalently 

linked to each end of the linear dsDNA (24). The BamHI A 
and D fragments of approximately 8kb and 4kb, 
respectively, were identified as the terminal genomic 
fragments because their migration into agarose gels was 
15 dependent on the pre-digestion of viral DNA with 

proteinase K. The internal BamHI fragments B, C, E and F, 
estimated at 6.2, 5.1, 3.4 and l.lkb in size respectively, 
were separated on an agarose gel, recovered and cloned 
into BamHI-digested pUC13 using standard ligation and 
20 transformation procedures (9). To clone the terminal 
BamHI A and D fragments, viral DNA (lOug) was digested 
with proteinase K (50ug/ml in lOmM Tris/HCL, pH8.0, 
containing ImM EDTA and 0.5% SDS) at 65°C for 60min to 
remove the terminal protein. The DNA was extracted twice 
2 5 with phenol/chloroform, once with ether and recovered by 
ethanol precipitation. The 3 'ends (of unknown sequence) 
were then digested exo-nucleolytically with T*, DNA 
polymerase (5 units, Toyobo, Tokyo, Japan) in the presence 
of dATP (lOOuM) in buffer containing Tris HCL (50mM), 
30 pH8.0, MgCl2 (7mM), 2-mercaptoethanol (7mM) and BSA 

(lOug/ml) for 15min at 37°C. The DNA was again purified 
by phenol extraction and ethanol precipitation described 
above. To remove the single-stranded terminal regions and 
create blunt ends the DNA was digested with 1 unit of mung 
35 bean nuclease (Pharmacia, North Ryde, Australia) for 10 

min at 37°C in buffer containing Na acetate (30mM), pH4.6, 
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NaCl (50mM) and ZnCl 2 ( InvM) before extraction with 
phenol/chloroform and recovery by ethanol precipitation. 
Finally the DNA was digested with BamHI (Pharmacia) and 
the fragments were separated by electrophoresis in low- 
melting-point agarose. The BamHI A and D fragments were 
excised, recovered by NACS column chromatography (BRL, 
Gaithersburg, Md) and ligated with BamHI/HincII-cut 
plasmid Bluescribe M13+ (Stratagene, La Jolla, Ca) prior 
to transformation into E. coll JM109 . Positive clones 
carrying fragments of the expected size were identified, 
restriction digested and confirmed as correct by 
nucleotide sequencing and comparison with partial sequence 
determined directly from genomic DNA. This revealed that 
three 3 '-terminal nucleotides were removed during the 
15 cloning procedure. 

Nucleotide Sequencing of the OAV287 Genome 

The complete sequence of the OAV2 87 genome was 
determined by sequencing the BamHI fragments A-F using the 
Sanger method (25) and various kits provided by commercial 
20 suppliers. Nested deletions were constructed for the five 
largest fragments using a double-stranded nested deletion 
kit (Pharmacia). These were sequenced using standard 
primers. Based on newly determined sequence other 
nucleotide primers were synthesised using a DNA 
25 synthesizer ( AB1, Model 391). In this way both strands of 
the entire genome and the junctions between the fragments 
were sequenced. 

Mutagenesis of the OAV287 genome 

For the construction of a full length OAV287 clone 

30 and subsequent modification of it to create plasmids such 
as pOAV200 and pOAV600 certain mutations were required. A 
relevant portion of the genome was subcloned into 
Bluescribe (Stratagene, La Jolla, Ca) or a similar plasmid 
which allowed rescue of single stranded DNA. Later it 

35 became possible to use dsDNA for mutagenesis. 

Oligonucleotides of the desired sequence were synthesized. 
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phosphorylated and used as primers as described by the 
manufacturers of Muta-gene Phagemid (Biorad Labs, Ca) or 
Altered sites II (Promega, Wi) mutagenesis kits. 
Mutations were generally identified by digestion with the 
5 appropriate restriction enzyme or by nucleotide 

sequencing, or both. Genome fragments containing 
introduced mutations were subcloned to create larger 
plasmids such as pOAV200 using appropriate unique 
restriction sites. 
10 Construction of a Full-Length Genomic Clone of OAV2 87 
The terminal BamHI A and D fragments (cloned in 
Bluescribe M13+) were each modified by mutagenesis to add 
the nucleotides lost during cloning and a Kpnl site. The 
last base of the Kpnl site incorporated the C at the 5' 
15 end of each genomic ITR sequence. This produced plasmids 
pAK and pDK (Figure 6). 

The left hand approximately 21.5kb of the genome was 
constructed from the BamHI D and B fragments and the SphI 
A fragment of approximately 13kb. The genomic BamHI B 
20 fragment cloned in pUC13 was modified by mutagenesis 

(GCATGC to GCATCC) to remove the SphI site at position 
8287 producing pUC13B. The modified fragment was released 
by BamHI digestion and cloned into pDK which had been cut 
with BamHI and dephosphorylated . Colonies carrying the 
2 5 recombinant plasmid pDBM (Figure 6) were identified by 

screening with an oligonucleotide which spanned the BamHI 
B/D junction. The SphI A fragment (approximately 13kb) 
was cloned into the SphI site of pSELECT (Promega) to form 
pSESPH. This fragment contains a Smal site near its left 
30 hand end which is common to pDBM. The Kpnl /Smal fragment 
from pDBM was subcloned into pSESPH which had also been 
cut with KpnI/Smal to produce pSELLH, a plasmid based on 
pSELECT which now contained the left-hand approximately 
21.5kb of OAV2 87 DNA. 
35 The right-hand end of the genome was constructed 

from pAK which contains the right-hand approximately 8.6kb 
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of the genome and overlaps the SphI A fragment. pAK was 
cut with Sail and ligated with Sail-cut pACYC184, a 
plasmid of 4.24kb which contains a gene encoding 
chloramphenicol (Cm) resistance and an origin for DNA 
replication, to form a pACm (Figure 6). This plasmid was 
cut with SphI and Kpnl to produce the right-hand genomic 
fragment incorporating the P ACYC184 sequences. This was 
ligated with the left-hand KpnI/SphI fragment of 
approximately 21.5kb prepared from pSELLH to produce the 
I final plasmid pOAV287Cm (Figure 6). This plasmid 

replicates stably in E. coli and therefore removes the 
need to propagate the virus to obtain genomic DNA for 
further study. The recombinant genome in plasmid 
P OAV287Cm differs from the wild-type viral genome by the 
5 single point mutation in the SphI site (base 8287), by the 
presence of P ACYC184 sequences in the Sail site and by the 
addition of a GTAC sequence between the ITRs . However, 
insertion of pACYC184 sequences in the Sail site disrupts 
two significant open reading frames whose functions are 
0 unknown. If either of the gene products was essential for 
replication, then pOAV287Cm could not produce infectious 
virus following trans fection. To circumvent this 
potential problem pOAV287Cm was modified further. First, 
plasmid Bluescribe M13- (Stratagene, La Jolla, Ca.) was 
• 5 cut with HindHI and end-filled. The linear plasmid was 
then cut with Smal, blunt-end ligated and transformed. 
The resulting plasmid contained an ampicillin resistance 
gene and origin of replication and lacked Sail and SphI 
sites but retained a unique Kpnl site. This plasmid was 
30 cut with Kpnl and ligated with KpnI-cut pOAV287Cm. 

Plasmids which were doubly resistant to ampicillin and 
chloramphenicol were selected and grown. One of these was 
cut with Sail to release the P ACYC184 sequences, religated 
and transformed. The resulting plasmid pOAVlOO contained 
35 the AmpR gene and replication Ori inserted in the Kpnl 
site between the ITR's of the genome (Figure 7). This 
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plasmid replicated stably in E. coli strain JM109 when 
maintained in the presence ampicillin (200ug/ml). Large 
quantities of plasmid were grown for transfection studies. 
Trans feet ion of DNA and Virus rescue 

To determine whether the recombinant genomic clone 
was infectious, pOAVlOO was cut with Kpnl to release the 
linear viral genome and DNA was transfected into CSL503 
sheep foetal lung cells using lipof ectamine (GibcoBRL) . 
Solution (A) containing plasmid DNA (2-10ug) and 300ul 
EMEM (containing hepes - glutamine), but lacking foetal 
calf serum (FCS) and solution (B) containing lipof ectamine 
(lOul) + 300ul EMEM (containing hepes + glutamine), but 
lacking FCS were combined, mixed gently and incubated for 
45 minutes at room temperature. Subconfluent CSL503 cells 
15 in a 60mm petri dish were rinsed with 3ml EMEM (plus hepes 
and glutamine) lacking FCS. EMEM (plus" hepes and 
glutamine) but lacking FCS (2.4ml) was added to the 
mixture of solutions A and B, mixed gently and added to 
the rinsed CSL503 cells. Cells were incubated for 5 hours 
20 at 37°C in 5% C0 2 • The incubation medium was changed 
using complete EMEM plus FCS (10%) and cells were 
incubated at 37°C in 5% C0 2 until virus plaques or 
cytopathic effect was visible (7-15 days). 

To confirm that viruses rescued from transfection of 
25 pOAVlOO and pOAV200 were derived from those plasmids a 
portion of the genome of wild-type OAV287, OAV100 and 
OAV200 viruses was amplified by PCR. For OAV100 a primer 
pair spanning the region of the mutated SphI site at bases 
8287-8292 was used. For OAV200 the primer pair spanned 
30 the insertion site for the Apal/NotI poly linker between 
the pVIII and fiber genes. Wild-type OAV287 DNA was 
amplified as a control in each case. DNA amplified from 
wild-type OAV287 was cut with SphI whereas the DNA 
amplified from OAV100 was not (Figure 8A) . Similarly 
35 OAV200 DNA was cut with Apal , EcoRV and NotI whereas 
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OAV287 DNA was not (Figure 8B) . Other viruses were 
similarly characterised by restriction enzyme digestion. 
Identification of MLP/TLS elements and Construction of pMT 

OAV287 TLS elements were identified as follows and 
as described (17). mRNAs present in OAV2 87 -infected 
CSL503 cells were copied into cDNA by reverse 
transcription using primers complementary to the Ilia or 
fiber genes. A primer thought to fall within TLS exon 1 
was then paired with each cDNA primer for PCR. DNA was 
successfully amplified, cloned and sequenced. This 
identified TLS exons 2 and 3 (which correspond to bases 
8083-8145 and 8350-8412 of Figure 1, respectively) and the 
3' boundary of TLS exon 1 which occurs at base 5044 of 
Figure 1 . A second PCR strategy was then used to obtain 
15 MLP and TLS fragments suitable for assembly into pMT. The 
region in Figure 1 between nucleotides 4861 and 5023, 
thought to contain the MLP was amplified by PCR using a 
plus sense primer which added an Apal sequence at the 5' 
end and a 3' minus sense primer which introduced an Ndel 
20 site by point mutation at base 5012. Similarly, the TLS 
was amplified using a plus sense primer which introduced 
the Ndel site at base 5012 and a minus sense primer which 
was complementary to bases 8396-8412 and which added a 
Hindi I I site at the 3' end of the PCR product. The PCR 
25 fragments were digested with Apal/Ndel and Ndel/Hindlll , 

respectively and the fragments were cloned into Bluescript 
SK+ (Stratagene) cut with Apal/Hindl II . The resulting 
plasmid was then digested with Hindlll/NotI and a 
synthetic oligonucleotide with Hindlll/NotI termini and 
30 the sequence shown in Figure 9 was cloned to produce 
plasmid pMT . Genes of interest were then cloned into 
convenient restriction sites in the NCS. Gene expression 
cassettes were subcloned as Apal/NotI fragments into 
pOAV200 or rescued into infectious virus. 
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Infection of cells and expression of antigens 

CSL503 and other cells were infected with viruses at 
a multiplicity of infection of 20pfu/cell as described 
previously (21). Infection was allowed to proceed for 24- 
60 nr. Cells were then incubated in methionine-f ree 
medium in the presence of 35 s _ me thionine to label newly 
synthesized proteins. The protein of interest was 
recovered from cell lysates by immunoprecipitation using a 
specific antiserum against the expressed protein (21). 
Recovered proteins were analysed by polyacrylamide gel 
electrophoresis and detected by autoradiography or using a 
phophorimager (Molecular Dynamics). 
RESULTS 

To characterise the genome in molecular terms, BamHI 
15 restriction fragments representing the entire OAV287 

genome were cloned into various plasmids and sequenced 
using methods described in Sambrook (9) and similar 
publications. Sequences were determined on both strands 
by using nested sets of deletion mutants together with 
20 synthetic oligonucleotide primers which were synthesized 
from newly determined sequences . 

The viral sequence of 29,544 nucleotides (Figure 1) 
is considerably shorter (by approximately 6.5kb) than the 
sequence for human adenoviruses but many genes encoding 
25 structural proteins are identified by their homology with 
their Ad2 homologues (Figure 2). It is clear, however, 
that the ovine adenovirus genome shows major structural 
and sequence variations compared with all other 
adenoviruses studied to date (Figure 2), in the regions 
30 encoding both structural and non-structural proteins. In 
particular, 

(a) the reading frames tentatively identified as 
forming the E1A/B regions are named principally on the 
basis of their location in the genome. Very limited 
35 homology can be detected between the 44.5kD open reading 
frame (orf) and the large T E1B protein of other 
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adenoviruses. Homology in the putative E1A region of 
OAV287 has not so far been detected; 

(b) in other adenoviruses the E4 region is normally 
located at the right-hand end of the genome. The OAV287 
E4? region is tentatively identified based only on the 
presence of a protein sequence motif HCHC . . PGSLQC which is 
found in 18.8kD and 30.85kD orfs in this region. 
Identical or very similar motifs are found in the E4 34kD 
protein of human Ad2 and Ad40 and mouse adenoviruses; 

(c) the distance between the end of pVIIl and the 
beginning of fiber, which in other viruses defines the E3 
region, is only 197 nucleotides (Figure 4). The E3 region 
equivalent, if it exists in ovine adenovirus, may consist 
of the cluster of open reading frames which are present in 

15 the right to left orientation on the complementary DNA 

strand, at the right-hand end of the genome (Figures 2 and 
5). However, these sequences show no detectable homology 
with any other adenovirus and the functions of these 
proteins cannot be deduced from such comparisons; 
2 0 (d) there is a region of approximately lkb which 

lies between E3? and E4? which has a very high A/T content 
(70.2%) (Figure 1). As there are no open reading frames 
encoding greater than approximately 30 amino acids in 
length on either DNA strand it is unlikely that the region 
2 5 codes for any proteins, unless mRNAs are generated by very 
complex splicing events. This region has no known 
equivalent in any other adenovirus; 

(e) other differences are apparent in the structural 
proteins of the virus. OAV287 lacks homologues of Ad2 
30 proteins V and IX. However, OAV2 87 has a completely new 
gene coding for p28kD which is located on the 
complementary strand of the E1A? region (Figure 2 and 3). 
This is a structural protein with an apparent size of 28kD 
by SDS PAGE which, according to N-terminal sequencing 
35 data, is cleaved from a larger precursor. No homology 
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between this protein and others in the databases has been 
detected; 

(f) in most other genomes the VA RNA genes are 
located between the Terminal protein and the 52 /55k genes. 
5 In OAV2 87 there is no room for them as the reading frames 
overlap . 

These differences serve to emphasize the unique 
character of the OAV287 isolate compared with other human 
and animal adenoviruses. In addition, since the OAV287 

10 non-structural regions show little or no homology with 
equivalent regions in other adenoviruses, sequence 
comparisons do not reveal the identity of likely non- 
essential regions of the genome. Moreover the viral DNA 
cannot easily be manipulated to test for dispensable 

15 sequences. 

The present inventors have produced a plasmid 
containing a full length infectious copy of an ovine 
adenovirus genome in which the ITR sequences are linked by 
a short sequence which creates a unique restriction enzyme 

20 site. A plasmid containing a full length infectious copy 
of an ovine adenovirus genome linked to a bacterial origin 
for DNA replication and a marker gene has been produced. 
Partial clones of OAV2 87 genomic DNA were specifically 
modified and initially linked to a gene encoding 

25 antibiotic resistance and origin of replication inserted 
into the unique Sail site of the genome (Figure 6 and see 
Methods). Such a plasmid can be grown in bacteria and 
more easily manipulated. 

The circular genomic clone differs from the 

30 naturally occurring circles that occur in Ad5-infected 

cells (10) and that might exist in OAV287-inf ected cells 
in that the 4 0 base pair ITRs are joined by a GTAC linker. 
Together with the last and first nucleotides of the genome 
(G and C, respectively, see Figure 1), this sequence forms 

35 a unique Kpnl site ( GGTACC ) when the ITRs are joined head 
to tail. Other sites such as EcoRI , BamHI , Sail, KasI etc 
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which have recognition sequences beginning with G and 
ending with C are suitable if they are unique as the 3' 
and 5' terminal nucleotides of other adenovirus genomes 
are G and C, respectively. A plasmid with a suitable 
antibiotic resistance gene e.g. ampR and origin of 
replication can be inserted at the unique site or 
elsewhere in the genome to form a plasmid which can be 
propagated in bacteria. Plasmids propagated in the 
presence of 200 ^lg/ml ampicillin in E.coli strains JM109 
and DH5-alpha retain the Kpnl sites and inserted 
sequences, indicating that the OAV287 ITR sequences are 
stable when linked in this manner. This approach may 
therefore be used to engineer other adenovirus genomes. 
If desired the GTAC linker sequence can be removed and the 
15 authentic termini regenerated prior to transfection by 
digestion with Kpnl (or another appropriate enzyme) and 
incubation with T4 DNA polymerase to create blunt ends 

(9). 

A method for generating linear infectious genomes 
2 0 from circular plasmids involved digesting the circular 
plasmid containing the full length copy of the OAV287 
genome with restriction enzyme Kpnl to generate a genome 
with the authentic 5' nucleotide dCMP . The linear DNA is 
then introduced into CSL503 cells using lipof ectamine as 
25 the transfecting reagent. 

To develop a viral genome as a vector it is 
essential to identify region(s) of the genome which are 
non-essential for function. These regions can be then 
substituted or deleted to make room for foreign DNA (11, 
30 12), or they may be the site for insertion of foreign DNA. 

In the human adenovirus genome DNA has been substituted or 
inserted into the El and E3 regions (13, 14, 15) and at 
the extreme right-hand end of the genome between E4 and 
ITR, usually with the concomitant deletion of non- 
35 essential regions to facilitate packaging of the genome 

(16). Adenoviruses will package genomes up to ""6% larger 
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than the wild-type, probably due to physical constraints 
dictated by the capsid structure (11). 

Non-essential sites in the OAV287 genome were 
identified by insertion of a polylinker sequence 
5 containing Apal and NotI restriction sites. This linker 
was introduced into the genome copy in pOAVlOO between 
nucleotides 22,139 and 22,130 of Figure 1 by site directed 
mutagenesis to create plasmid pOAV200 (Figure 7). This 
corresponds to a site located in the intergenic region 
10 between genes for the pVIII and fiber proteins which was 
chosen because it avoids disruption of RNA processing 
signals in the region. A transcription termination site 
for the L4 family of RNAs maps 26 nucleotides upstream and 

the splice junction between 

15 the tripartite leader sequences and fiber mRNA maps 144 

nucleotides downstream of the insertion site/ respectively 
(17). Transfection of pOAV200 into CSL503 cells resulted 
in the rescue of virus OAV200. The second site at which 
the polylinker was inserted was located between bases 

20 26,645 and 26,646 of Figure 1. This created plasmid 

pOAV600 (Figure 7). This insertion site corresponds to 
the right hand end of the A/T-rich region (Figure 2) whose 
function and precise boundaries are unknown. The site was 
chosen as it is six nucleotides to the left of the 

25 transcription termination point for RNAs transcribed from 
right to left from the E3? region (Figure 2). This was 
determined by sequencing cloned RT-PCR-amplif ied cDNAs 
derived from the region using methods similar to those 
described for the pVIII/fiber region (17). Transfection 

30 of pOAV600 into CSL503 cells yielded virus OAV600. 

The above insertion strategy identified two regions 
of the genome which can be interrupted and created sites 
for subcloning gene expression cassettes. 

A further non-essential site was identified using 

35 the unique Sail site located at bases 28644-28649 . of 
Figure 1. The site was cut with Sail, end-filled and 
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religated to disrupt the reading frames which spanned the 
site. A plasmid pOAV600S (Figure 7), which had lost the 
site was identified by digestion with Sail. When pOAV600S 
was transfected into CSL503 cells, virus OAV600S was 
5 recovered. The loss of the Sail site in this virus was 

confirmed by digesting the viral genome with Sail. As the 
Sail site falls within two significant open reading frames 
(which extend on the complementary strand between bases 
28457 and 29014 and between 28511 and 28699), which were 
disrupted by end-filling and religation, the gene products 
derived from the reading frames are probably also 
dispensable. This group of reading frames may therefore 
constitute the E3 region of OAV287 as no other gene 
products in any adenovirus are dispensable for 
replication, in vitro. This implies that it should be 
possible to delete the whole region labelled as E37 in 
Figure 2. In addition, in other experiments a lkb Ndel 
fragment was deleted from the region marked as E4? in 
Figure 2 . This deletion disrupted several reading frames 
in the region. No virus has been rescued from a such a 
plasmid, suggesting that it is not dispensable and 
accordingly, it may be E4 . 

Many viruses replicate incompletely in heterologous 
hosts, often entering cells but being unable to produce 
25 mature virus particles because of a block in the 

replication cycle. In the context of recombinant viral 
vectors, this represents a desirable safety feature, 
provided that replication is not blocked before 
appropriate and effective expression of the foreign gene 
30 occurs. OAV287 does not replicate productively in 

heterologous cell types (18), the only exception so far 
being bovine nasal turbinate cells in which viral titres 
are significantly reduced compared with the CSL503 cells. 
Recombinant forms of OAV287 have been constructed to 
35 determine whether expression of a reporter gene under the 
control of an appropriate promoter occurs. 
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Foreign gene expression requires that the gene be 
functionally linked to a promoter. This may be a viral 
promoter inherent in the genome, or a foreign promoter 
subcloned together with the gene of interest into a 
5 suitable site. The promoter driving gene expression must 
function in CSL503 and preferably a range of other cell 
types. In this work an OAV2B7 genomic promoter was used 
initially. Subsequently an heterologous promoter was also 
used. In adenoviruses, expression of the structural 
10 proteins is driven by the major late promoter (MLP). 

Families of RNA transcripts derived from the MLP contain a 
common sequence element, the tripartite leader sequence 
(TLS) at their 5' ends. The present inventors have 
identified those nucleotides in the OAV2 87 genome which 
15 comprise the TLS by using RT-PCR amplification of late 
mRHA transcripts present in OAV287-inf ected cells and 
sequencing of cloned cDNAs (17). A candidate MLP was 
expected to be present just to the left of TLS exon 1 
(Figure 2). The MLP and TLS elements were subcloned using 
2 0 PGR techniques into a separate plasmid pMT (Figure 9) and 
linked with genes of interest. These promoter /gene 
cassettes were subcloned as Apal/NotI fragments into the 
polylinker Apal/NotI sites of pOAV200. Using this 
strategy plasmids pOAV203, pOAV204, pOAV205 and pOAV210 
25 were constructed. These incorporate genes encoding a 17kD 
soluble protein from T. colubrlformls, a rotavirus VP7sc 
gene (19), the 45W antigen from Taenia ovls (20) and a 
membrane protein (PM9 5) from Lucllla cuprlna, 
respectively. Plasmid pOAV202, contained the 17kD antigen 
30 but lacked the MLP/TLS elements. These plasmids were 
transfected into CSL503 cells and rescued as viruses 
OAV202, OAV203, OAV204, OAV205 and OAV210, respectively 
(Figure 10) . 

The human cytomegalovirus immediate early IE94 
35 promoter plus enhancer, which functions in a range of 

human and animal cell types (21), was also linked to the 
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rotavirus VP7sc antigen gene. This cassette was assembled 
by replacing the MLP/TLS elements in pMT/VP7sc with the 
HCMV enhancer-promoter region. The cassette was inserted 
in P OAV200 to create pOAV206. pOAV206 was transfected 
into CSL503 cells and virus OAV206 was rescued (Figure 
10) . 

CSL503 and other cells were infected with the 
viruses described above and at various times post- 
infection the cells were radiolabeled with 35 s _ 
methionine. Proteins of interest were recovered from cell 
lysates by immunoprecipitation using an appropriate 
antiserum. Recovered proteins were analysed by 
polyacrylamide gel electrophoresis and detected by 

autoradiography. 
15 when virus OAV202 was used, no expression of the T. 

coulbriformls 17kD antigen was observed by 
immunofluorescence. As this virus lacks the MLP/TLS 
elements and carries only the 17kD gene this result 
demonstrates that there is no viral promoter upstream or 
adjacent to the insertion point between the pVIII and 
fiber genes which is capable of driving gene expression. 
As the E3 region is also missing from this site there is 
no requirement for a nearby promoter. This situation 
contrasts with observations made using a human Ad5 E3 
recombinant (21). In this case a promoterless gene 
inserted 3' proximal to the pVIII gene was expressed, 
probably from the adjacent E3 promoter or the upstream MLP 
(15, 21). This result further emphasizes the unique 
nature of the OAV287 genome. Recombinant OAV287 viruses 
carrying the MLP/TLS elements were tested for expression 
in CSL503 cells. With OAV204, expression was easily 
detected in infected, but not in uninfected cells at 24hr 
post-infection (Figure 11A). Similarly, when viruses 
OAV205, and OAV210 were tested, gene products of 24kD and 
35 approximately 95kD, respectively were detected (Figure 

11B). Therefore it is clear that MLP/TLS elements contain 
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the necessary information to drive gene expression in the 
homologous cell line under replication-permissive 
conditions. However, when OAV204 was tested in a 
heterologous rabbit kidney cell line in which the virus 
5 does not replicate productively, no VP7sc expression was 
observed. Some replication occurs in bovine nasal 
turbinate cells, although to a lower titre than in CSL503 
cells. In the latter cells, expression of VP7sc was 
detected following infection with OAV204 (Figure 11B) . 
10 Virus OAV206 containing the HCMV enhancer /promoter 

element linked to the VP7sc gene was used to examine the 
function of a heterologous promoter in the context of the 
OAV287 genome. CSL503 cells infected with this virus 
readily expressed VP7sc antigen at 24-48hr post infection 
15 (Figure 12A) . With this virus VP7sc expression was also 

observed in the non-permissive rabbit kidney cell line and 
in bovine nasal turbinate cells (Figure 12B) . These 
results suggest that the HCMV or a similar constitutive 
promoter may be preferred over the MLP to drive gene 
20 expression in OAV recombinants in non-permissive cells. 

One recombinant virus was also administered to 
sheep. Five sheep were vaccinated intracon junctivally and 
intranasally with 0.7xl0 8 pfu of OAV203. At three days 
post-inoculation virus was recovered from the nasal swab 
2S of one sheep and from the conjunctival swabs of two sheep 
and confirmed as the recombinant virus by PCR analysis. 
Animals showed no obvious- ill effects from such 
vaccination. 

The viral vectors of the present invention can be 
30 used for the delivery and expression of therapeutic genes 
in grazing animals. In species which are not normally 
infected by ovine adenoviruses the lack of pre-existing 
immunity should allow efficient infection, gene delivery 
and expression. The genes may encode vaccine antigens, 
35 molecules which promote growth in production animals, 

molecules which modify production traits by manipulating 
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hormone responses and other biologically active or 
therapeutic molecules. The virus does not replicate 
productively in many non-ovine cells but the use of 
heterologous promoters allows the delivery and expression 
5 of genes while minimising the possibility of virus spread 
to a non-target host. As the DNA of adenovirus vectors 
can persist in cells in an unintegrated form, with the 
appropriate choice of promoter, expression over a 
prolonged period can be achieved. 
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CLAIMS: 

1. An isolated DNA molecule comprising, a nucleotide 
sequence encoding the genome of ovine adenovirus (OAV287) 
substantially as shown in Figure 1 or a functionally 

5 equivalent nucleic acid sequence. 

2. The DNA molecule as claimed in claim 1 such that the 
nucleic acid sequence encoding the genome of the ovine 
adenovirus is substantially as shown in Figure 1. 

3 An isolated DNA molecule comprising a nucleic acid 

10 sequence encoding the genome of ovine adenovirus (OAV287) 
substantially as shown in Figure 1 wherein a portion of 
the adenoviral genome not essential for the maintenance or 
viability of the native adenovirus is deleted or altered. 

4. An isolated DNA molecule comprising at least a 15 
15 nucleic acid base sequence being substantially unique to 

the ovine adenovirus (OAV287) nucleic acid sequence as 
shown in Figure 1. 

5. The DNA molecule as claimed in claim 4 such that the 
at least 15 nucleic acid base sequence encodes a 

20 functional element of ovine adenovirus (OAV287). 

6. The DNA molecule as claimed in claim 5 such that the 
functional element is selected from the group consisting 
of promoter, gene, inverted terminal repeat, viral 
packaging signal and RNA processing signal. 

25 7. The DNA molecule as claimed in claim 6 such that the 
functional element is the inverted terminal repeat having 
the nucleic acid base sequence 1 to 4 6 as shown in Figure 
1 . 

8. A plasmid including the DNA molecule as claimed in 
30 any one of claims 1 to 7 . 

9. A plasmid including the DNA molecule as claimed in 
any one of claims 1 to 3 such that the nucleic acid 
sequence encoding the adenovirus genome or a portion 

• thereof is linked to a nucleic acid sequence encoding an 
35 origin of replication and a further nucleic acid sequence 
encoding a marker. 
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10. The plasmid as. claimed in claim 9 such that nucleic 
acid sequences encoding inverted terminal repeats of the 
adenovirus are joined. 

11. The plasmid as claimed in claim 9 or 10 such that 

5 the nucleic acid sequence encoding the marker encodes for 
resistance to an antimicrobial agent. 

12. A viral vector comprising a DNA molecule including a 
nucleic acid sequence encoding the genome of ovine 
adenovirus (OAV2 87) substantially as shown in Figure 1 or 

10 a functionally equivalent nucleic acid sequence or a 

portion thereof and at least one nucleic acid sequence 
encoding a non-adenoviral polypeptide or polypeptides. 

13. The viral vector as claimed in claim 12 such that 
the nucleic acid sequence encoding the genome of the 

15 adenovirus is substantially as shown in Figure 1. 

14 . A viral vector comprising a DNA molecule including a 
nucleic acid sequence encoding the genome of ovine 
adenovirus (OAV287) substantially as shown in Figure 1 
wherein a portion of the adenoviral genome not essential 

20 for the maintenance or viability of the native adenovirus 
is deleted or altered, and at least one nucleic acid 
sequence encoding a non-adenoviral polypeptide or 
polypeptides . 

15. The viral vector as claimed in any one of claims 12 
25 to 14 such that the nucleic acid sequence encoding the 

polypeptide or polypeptides encodes a polypeptide or 
polypeptides derived from bacteria, viruses, parasites or 
eukaryotes . 

16. The viral vector as claimed in claim 15 such that 

30 non-adenoviral polypeptide is rotavirus VP7sc antigen, the 
parasite polypeptide is Trlchostrongylus colubrlformis 
17kD antigen, the Taenia ovis 45W antigen or the PM95 
antigen from Lucilia cuprina . 

17. A method of delivering a DNA molecule having a 
35 nucleic acid sequence encoding a non-adenoviral 

polypeptide or polypeptides to a target cell, the method 
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comprising infecting . the target cell with a viral vector 
as claimed in any one of claims 12 to 16 such that the DNA 
molecule encoding the polypeptide or polypeptides is 
'expressed and the polypeptide or polypeptides is produced 
5 by the target cell. 

18. A method for delivering a DNA molecule having a 
nucleic acid sequence encoding a non-adenoviral 
polypeptide or polypeptides to an animal, the method 
comprising administering to the animal a viral vector as 
10 claimed in any one of claims 12 to 16, such that the viral 
vector infects at least one cell of the animal and the 
infected cell expresses the DNA molecule encoding the 
polypeptide or polypeptides and produces the polypeptide 
or polypeptides, 
X5 i9. The method as claimed in claim 18 such that the 
animal is a grazing animal. 

20. The method as claimed in claim 19 such that the 
grazing animal is a sheep. 

21. A viral vector comprising a DNA molecule including a 
2 0 nucleic acid sequence encoding the genome of ovine 

adenovirus (OAV287) substantially as shown in Figure 1 or 
a functionally equivalent nucleic acid sequence or a 
portion thereof and at least one nucleic acid sequence 
encoding a functional RNA molecule. 
25 22. The viral vector as claimed in claim 21 such that 
the functional RNA molecule is an antisense RNA molecule 
or ribozyme. 

23. A method for delivering a DNA molecule having a 
nucleic acid sequence encoding a functional RNA molecule 

30 to an animal, the method comprising administering to the 
animal a viral vector as claimed in claim 21 or 22, such 
that the viral vector infects at least one cell of the 
animal and the infected cell expresses the DNA molecule 
encoding the functional RNA molecule and produces the RNA 

35 molecule. 
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Fig 1 

CTATTCATAT ATATAACGTT GCACAGAGGC GGGGCGTGTG GGTTTTTTAT TGTTTATTGT 60 

CATGGAATTT ACAAAGAAGT AAGTTGTTGG ATCTTTATTC ACAATTCTTT TAACAATGAC 120 

TTTTTTACTT ATTACATTTT TCATCTTTTT TACTTCACAT GATATTTTAC TTAAATTTTG 180 

TACATACAAG CCAAAATTCG CATAAAATGT CTTACTTTAA AAAGTTAAAT TTTTTTTTTA 240 

ACGCATAAAT GGACGTACAG CAGCAATTGG AATAGCAGGA AGGGCCATTG TAAAGTGTGT 300 

TCCTGCTGAT GCCGCTGCAG AAAGGATAGA TGCTATCGTA CGCATAAACC CCCCTCCTAT 360 

TTGTTCATCT GCTGCTTTTA TTATATCTTC TGCCAATCTA GGTGATATTT GCTTTTGAAT 420 

GCTGTTTCCA AAAGCTTGCA TCATCGGATT TTCAATTAAA TGGATTGGAT TTGCAGAATT 480 

TCCTTAAAAA TAGCCCAACC CATCTAAAGC AGTTAAAAGT ATTCTCCCTC CAGGAACCAC 540 

AGATATAATT AAGCGGAGCA ACCGAGAGGT TAAATTCCAG GGTCCTCCGA AGAGAGTATC 600 

TAGGATCAGG CCAAGAAGTG AACCAAAAAG ACTTGTAAGT AGAAGTTGTC TGATATGCTT 660 

TGGAGAGGAC TGTTAAAATT GCAAAACGGT ATCTAATGAC CATTTCTTCT TTACTTTTAC 720 

ATCTGTATCA TGTTCTCCAT CAGAAGGTCT TATTGGGAAG TACCATTGGT CACGAGCATC 780 

TTTGAAGACT TCTGTTTCTT GAAATTCTGT TTTCGGTAAG CGACTAGCAG TTATGGTATT 840 

AGGAATATTG ACGGTAATGT TATTCACATC TACAATTTCT GGAGGAATCC ATCTTGCATA 900 

GGATGAAATG GGTTTTGTGG GTTCTTTCAA TATATAATTG CGAGGAGGGT TTTTCCAAAA 960 

TCTCTGAACA TAAGTATTTT CTGATTTTGG CGGTTTTTTG CTTTTTCGCG CTCTTTTTCT 1020 

TGGCTTTGGT CTTTGAAATT TTTTCTTCCT TTTTCTGTAG GCTCCTCCTG CTAAAGCTGT 1080 

GTTATTTGTG ACGTACATCC TGTTAGCTAC ACGATTTTCC CGGACTGCAA ATTTTTTTGC 1140 

CAAATGGAAA AGAAATTGCT GAAACCTTCT ATTAATCATA TAAATTGTCA GTGGAATCAT 1200 

GAATCAGATA GTGCAGGATT TTTTCTTTTT GATACTGATA ATTTATACTA TTATGTATTG 1260 

GATCAAGTGT CTTGGATATG TTTAAGAGAT ATAACTCTTC ATTGTGATCG CATGTGGTTA 1320 

GCGGTTTGTT TTTGTTTGTG CAAATCTAAA TTTGATGTAC ACAATATTCT AGCGGGAGTA 1380 

CATGTTATGT AATGAAAATG ACGTCGGGGA TTGAATGGAT TGAGCCTTAT TTGACATTTT 14 40 

TCTGTGATTT TTTTGCCTTA TTAGGAAATA AATTTGTGGC GCCAGTACGA TGGAGATTGG 1500 

AATGACTCCT GCATTTACAG AAAGGAATTT GTACTGTGTT TTGCTTGACT TTAATTTAAG 1560 

ATGGTATCAG CAGATATTTA ACCCAATATG GATTAAGCCA AATTTATGGG CTTTCTCTGA 1620 

TTTTTTAAAA AAAATGGCCT TTATTTATGC TAGCGACTTG GCGTTGTTAA ATTCTTACAT 1680 

CCCTGGTAAT GTTTGTAACA AACTTGATAT CATCAAGAAA GATCTTCCTG AAGATTTTAC 1740 

CGTGTCTATG TTTTGTGTCT TAGTGTGTTG GCTTGCTTCT TTCTGTAAAG GTTCTAATTT 1800 

AGCTGAAACT CGCCAGAATT GTCACGCGGT AAGCAAATTT CTGGCACAAC TATCAAAATT 1860 

AATAAAACCC TAATTTTTAG TTTGTAAAAA TAGAATTCAA ATTTTTAACG CCACAATGAC 1920 

TTCGGCGGAG TTTTCTGTTG AATTTCCTTA TGTTTCTAAG CCAATTGTTC CATGGCCTGC 1980 

TTCGGCATCT TCTAATAATT CATCGAGTCA GAATATTGAC TTTCCTGTTC TTAAACCAGA 2040 

TCAAGATCCA ATAGCCTTCT TTCAAACTAA CAATACGGCT TACTTACAAC CTGGAGCTAC 2100 

TTATTACTGG AAGTGTATCG AACTGTCAAA GCCTATTCAC ATTTACGGTC AAGGAGCTAC 2160 

AGTACAACTT GTCGGACCTG GACCTGTGTT TGTTTTCAAC AGTGAAAGTG TTATTCCTGA 2220 

AGATTTTTAC GTCGTGTTTG AAAATATCAA CTTTATTGAA GATGAATTTC CTATTAGAAG 228 0 

TGGCCAGTTA AGTTTAGGAC TTACAACTCA CAGTGCTGTA TGGTTTATCA ATGTATGGAA 23 40 

AACTTCAATA GTCAATTGTA ACTTTAAAAA TTTTAGGGGA GCGGCTCTTT GGTATTCAGA 2400 

TAATAGAAAT TTTTGGAATG CGAGAAAATG GAATCAGCAG CATTTAGTTT CAAATTGTCG 2460 

TTTTAATGGT TGTAGAATTG GAATTTCTAA TACTGGTTCA TCTGAATATT CCATAGCCAG 2520 

TCAAAATCAA TTTTATGATT GTCAAATCTG TTTTAATGTA ACCGGGGGTA ATTGGTCTAG 2580 

AAATAATAAT GTTATTGTTA ACTGTAGATG TGCTTATCTG CATGTTGGAG ATAACATGTG 2640 

GTATGAAGGC CATTCCGAAA ATAATAATCC CGCTAAGGGT ACTTTCTGCA ATAACATAAT 2700 

TAACCATGCT GATAACGGAG GCAATGTCTG GCCTACTCAG TTTAAACTTA CAGATGGATC 2760 

AACGATACAG TTAGCATCAT TTTATTTTGA TGATAATCAA GAAATTCCAC CTTGTTATAG 2820 

CGGTAATTTT CATTGGTTTG GAGATGTAAA CATTGTAAAT TTTTCTACCA CAAAAATTGA 2880 

TAAATGGTGC ATTACTGGAT GTAATTTCTA TGGTAATACA CATGCAGCTA ACGATGCTGG 2940 

TCAAGTTCAG GTTGCTGAAG CTGTAAAAGA CAAAGTGTTT ATTATTGGGT GTTCTGGTAA 3000 

TAATGTAACC ATGAAAAATA TTGTAGAAGG TAACATGACT CCAAAAATTG GTACAATAAA 3060 

GTAAAAAACT TTTTATTCAA AACAAAATGG ATTTACATTT AAACGTTTTA CATATTGATT 3120 

CTGCGTATAA GTTCTTTTTC TAAACACTCT TCTAATTTCC ATACATGCTT GATAAAACAA 3180 
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Fig 1 (cont ) 

ACTTTGTAAA TTCATAAATA TAGGTTTGAC TTGATCAGAA GGTGAATAAT AGCTCCATCT 3240 

AAATGATTCG GTAATAGGAA CATTATTATA TATTAACCAG CTATATTTTG AGTTAACTCT 3300 

TGCATGATCC ACTATATCTT TAAGTACAGG GATAAGTGCA CTCGGAAATC CAAAAGAATA 3360 

GTTTTTAATA AATCTATTTA TCTGTGAAGA ATCAAGCTGC GGACTAATAA CATGACATTT 3420 

TGATTGAATT TTTAAATCCT TAATATTTCC TCTATCATGA CGCGGGTTCA TATTATGTAA 3480 

AACTACTACA ACAGTGTAAC CATTACATTT GGCAAATCTA TTAAAAATTT TTGACGGTAA 354 0 

AGCATGAAAG AAAGAACTTA TAGAATGACA TGATCCCAAT TGATTCATAC ATTCATCTAT 3600 

TATAATACAG ATAGATCCTT CACTTGCAGC TCTGCAGAAT ATATTATCTG GATTATCAAT 3660 

ATTTAGATTA GTATCGGAAA TAGCATCTTT GAAAGCTAAT TGTATAAATT TTGGATTTAA 3720 

TGTTTTTGTT AG TG GAT TAG AGAATGCATC GTAGTTTCCT TCAACACACT GTGCTTTCCA 3780 

CGCAATTTTT TCTTCTAATG GAACAGTACC TTTTTCTGGA GTTATGAAAA AAATTGTTTC 3840 

TGGTATTGGA TCAATTAGTT TTCCAGATAT AATATTTCTT ATAAATTGAG ATTTTCCGCT 3900 

ACCTGTGGGT CCATATACAG TAACAATGAA TGGTTGTAAT CCGCAGTTTA AACTGGGTAT 3960 

ACAGCCATCT TTTAACAGAT TGTGAGCCTC ATTTACAGTT TTTTGATAAT TTACAGCAAT 4020 

ATTGTGTAAA TCAGTCATAA GTTGACCATG ATACATACAT TTATCAAAAA CTTCTTGACT 4080 

TTCTGGAAAT GGATTTCTGC AAATAGAAGG ATCTATCTTT ACAACATCAT TTTTCCAATT 414 0 

TAATGTGTCA CTTAAAAATT TTCCCAAAAA GGATTTTCTG TCAATGGTTC TTGCGGTCTT 4200 

GGATTTGGGT GTCTCTTGTC GTACGGGTAA AGTAAGTATC CTTTCTTCCA CTGGATCCCT 4260 

TTCCTCATCG TTTGATCCTT CCAAGGTCTC AGAATTCTGG TTAGTTGCTT CTCTACCACC 4320 

GTGAATGGTA CATCGGTTCC ACTTGCGGTT TGCAGTGTCT TTTTTAAACT TTTCCTCGAT 4380 

GTCTGAAACT CTTTCTGTGG TTGTTCTAAT AAATTATAGT CAGTAAAACA ATGTTTTAGA 4440 

ATTTCATAGT TTAAACAATT TTTAGCATGA CCTTTGGCTC TTAATTTTCC TTCTCCAATA 4500 

AATTTACAGT TTTTACAAGT TATGTCTTTT AAAGCATATA ATTTAGGAGC TAAAATACAT 4560 

GTTTCTGAAC TGAATGCTTC AGCTCCGCAA CGGTTACAAA CAGTTTCGCA TTCAACCAAC 4620 

CAAGTTAGAC ATGGATGTTT TTCATCAAAG ATTAAATTTG AGTTATATTT TTTAAGTCTA 4680 

TGTAATCCTT TTGATAACAT GAGTTGGTGG CCCTTTTCTG TTAAGAATAA CGAGTCTGTA 4740 

TCACCATAAA TACTTTTTAT CTCCCTTTCT ATGTAAGGTT TACCCATATC TTCCCCATAT 4800 

AAAATTTCTG CCCACTCACT CATGAAAGCT CTGGTCCAAG CCAGCACAAA GGATGCTATC 4860 

TGAGTTGGAT ATCGGTTGTT CTTGATCCAT TCTTCCTTAT CCTCAATAGT TGTTAAAATT 4920 

AAATCATTAC AATCAGCAGA TAAAAAAGTT ATAGGCTTAA AAGTCACGTG ATCTTGATTT 4980 

CCTATAAAAA GTGGAAAATT AAAATTTTCA TTTGTGTCTT TGGAATCTTT GGGCGGCATT 5040 

TCAGGTAGGT TTGAAAAATA CTGATTCCAC TCAAATGAAC GTTTTGGTAA TGATTTACTA 5100 

ATCACAGTTG TGTATGATGT AATTTCAGCT GATCCATTTT CTAATCTTTT TTTATCTTTC 5160 

TCTTCAATAT TTTCAGCAAA CACTACTTTC TTTTTATCTA TACGGGTAGC AAACGAACCA 5220 

TATAAAGCAT TTGATAACAA . TTTACTTATA CTTCGCTGAA TCTTGTTGTT ACTTTTACTT .5280 

GCTTTTTCTT TAGCCATAAT ATTTACTTTC ACATATTTTT GACATAACGG TTTCCAGTCA 53 40 

CTCCATACAG CATACATTTC AGAGCTTTTG ATTATTTTGC ATTTCCATCC TCTATTGTGT 5400 

AAGGTGATTA AATCGATAGA GGTCAGTACT TCATTTATCA ATGTTTCATT TGACCAGCAT 5460 

AACTTTCCAC TTTTTTTAGA ACATAATGGA GGTAACACAT CAAGATAATC TAATGATGGG 5520 

GGTTCACAAT CGGCTACCAC AATCATAGGT TTGATTGAAT TGTCAAAATA ATCTATTTTT 5580 

TCTTTTCTTT GTAGTAGTTC TTGAAAGTAA TCTATTTGTG CATTGGCTTC AAAAGCATTT 5640 

AAAGTTTTTC CATATGGAAG TGGATGCGTT AAGGCACTAG CATACATTCC GCAGATATCA 5700 

TACACATATA TTGCTTCTTC AAATATTCCT AAAAATGAAG GATAACATCT TCCTCCTCTT 5760 

AAACTCATTC TAACAAAATC ATACATTTTT TCTGATGGAG CTTCCAAATT TCTTAGGAAT 5820 

TCAGAGGGAT GATCTTCTTC ATTATAAAAG ATTTGTTTAA ACAATGCTTG AGTATTACTA 5880 

CTAATTGTAG GACGTTGGAA TATATTAAAA GAACACTCAA GCTTTAAAGA TGTTGTACAG 5940 

AACTCTTGAT AACCTTCTAT AAGTTTTTCA ACTAATTGAG CCGTAACTAT AACATCATCA 6000 

ATACAATACT CCTTAGCTTC CTCTAATAAG TTGTATTTTT GGTTGTGTTT TGGTTTGTTT 6060 

TGTAAATATT CTTCAAATGA ATTCCAATAT TTTTGAACTG GATAACCATT GTTTTCTTTT 6120 

TCATATTCTC CCAACATAAA AAAATCATTG ATTGCCCTGT AAGGACAATA ACCTTTGCTA 6180 

ACACTCAACT GATATGCAGT AGCAGCGTCT CTTAAAGAAG AGTGGGTTAA CAAAAATGTA 6240 

TCCCTAACCA TAAATTTTAT ACCTTGCCAT TTCATATCTT CAAAATTAAT AATTCCATTT 6300 

TTCCATCTTT CATAAGTTGT ATGTGAAGGT TTCTTAAAGC AAGGATTTGG AAGAGATAAT 6360 
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Fig 1 (cont) 
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CGAGGCATAA 
AATACATGAC 
AATTCAATAT 
TCTTCATATG 
TTAGCTACTG 
TCTTTTTTAT 
ATTTTAGAAG 
GGAGTTAACT 
AAAAGACGTT 
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AAATCTTCTG 
CCGGATTTTG 
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Fig 1 (cont) 
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Fig 1 (cont) 



TAGAAAAAAT 
GTTATAAAAA 
GAGGATTAGG 
CTAACAATAC 
AAGCAGGGTT 
ATACAACTCA 
GCACCGTAAC 
TAAAAAACAG 
GTCCATACAT 
TTCAGTAACA 
TTAGTTAGTC 
GGCTTGAAAT 
GGACAGCTTA 
AAATTGTACA 
ACATCTCCTG 
AATAGAGCTA 
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TAATCTTGGA 
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ATATTTTGAC 
AGCrTATAAT 
TGAAGCTACA 
TGTAGAAGAA 
ATATGCTACT 
TTTAGGAATT 
ATCAGCAGAT 
TACAGATGAC 
TGCTCATTAT 
TTATATTGGT 
AGGAACATTT 
CAGTGAACTA 
ACTTTGGAAC 
ATATGAAGAA 
GCCTACTGCG 
AGATAACACC 
AGCAAATCTA 



AAGTGACAAT 
TAATGGAGGA 
TCAAATTTAT 
TACAAAGCCA 
AGTTCATAAT 
AGTATTCAAT 
ATGGATAAGT 
CCTTACAGGT 
ACAGAAATCT 
ATCTGGCTGA 
CCTCTGATAA 
TTTCTAAAAA 
ATGCTCGTAC 
GGGACAGACT 
CGGATCGACT 
GAAGAGCAGC 
ATCTCCTCAT 
TCCATACATT 
AGCAGCCGCA 
TAATCGTTCT 
TAGCTCCCCA 
GAGGAGGTTT 
GTTCATTTAT 
AACAAGGTCT 
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AAAAAGTTAT 
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AGATTTACTT 
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ATTGAATATA 
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AGCTATCAAT 
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GCAGGTAATG 
AAGGCTTTTA 
CAACGTACAT 



CCCCCCAGAT 
GCTAAAGCCC 
TGGTCAATGC 
GAAACACTTC 
ATAAATGCGG 
AGATTTCCTA 
GAAAATGTCC 
GTACAAAGAG 
TTGGCGACTG 
TATCTCTGGG 
TACCGGGTGG 
ACAACCTGTT 
TTCACTTGAG 
AAAAAGAAAA 
TAAAAAATAT 
CCAATAAATA 
CGTCGAAGAC 
TTACCTAAAG 
ATTAGCGCAG 
TAAAATTTAG 
TTGCGGCTTA 
CAGCTGGTCT 
ATCAAATACT 
ACTGCAATCA 
GGTAGATATT 
AGGGAACGAC 
AAAAGATGAA 
TAAATCTAGC 
CATAAAAACC 
TCACCCAAGA 
AGATGGAGTA 
TGGAGCCACA 
AAAATCTGGT 
GAGATCCTTT 
TTCGTATAGT 
TAAATGTAGG 
TTATTGATAG 
CAAAATCAGC 
TTGCTCAACT 
TAGAAGCAAA 
CCAAATTTAA 
GAGATCTTTT 
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CTGATGAAGC 
ATTTTATTGG 
CACAACAACT 
ATCTAATAGC 
ATAGTTACGA 
CCTTATCATT 
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TAGGATATGG 
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GGCAAACAAA 
TAACTGTACT 
CAGATACTTT 
CAATTGTTGG 
TTTATTCTCA 
AAAATGCTAT 
CCTTTGTTGC 
TTACTATAAC 
TTGTCCCTAA 
CCTTATCCTC 
GGTATTGGAA 
AGAGTGCGAC 
AAACTAAAAA 
ACAGTTGTTC 
CTTAAAGCTG 
TTATTTTTCA 
ATACACGTCG 
AATTGCAAGG 
CCCCTGCTAT 
AAAACTTTTT 
ACACCTGTTT 
AGTTTGGGAA 
GCTCAAAAAA 
AATGTTTTAG 
GGAAGATTAA 
CAACAAATTA 
ATGTTTGTAA 
CAAATGCCTG 
AAAGAAGTTA 
AAAAGAAAAC 
AATTTTAATA 
GCGTGAATTT 
ACAATTCATC 
TGTAGCTCCA 
TCCGATTCAA 
AGATAACAGA 
AGGACCTACT 
TTTTCCCAAT 
CCCTAATGCT 
CACTACTACT 
TCCAGAAACT 
TCCGGCTTAT 
CATAACTAAA 
AGTTGACACC 
CAAAGCTACA 
ACTCATGTTC 
TAATGTTGTT 
AGATCTGACA 
CCAGTATGTC 
TCCTTCTCAA 
AGACACGGGT 
CAACATGCCA 
TAATGTAGCA 



GTACAGATCT 
AACTGTTCCG 
TAAAGCACCT 
ATTACATATG 
ACTTTTGGAA 
ACTAATGCAA 
AGATCACGGG 
AGACGACAGA 
AGTACTTTCA 
CTGGAACCGT 
CATCAAGTAT 
CTTATTACAG 
CCAAATTGAA 
CAAAGAAAAA 
TCAGTCAAAT 
CTTGCAGATG 
TTACAAAAAA 
CGGTTTTTTA 
AGCTGGAACT 
TTTTAACAGA 
ATGGCCACAC 
ATTCTTTTAC 
TAGGTCAATC 
AAAATGCAGG 
AGGTAGAGAA 
CTCAAGAACA 
AGCAATCAGA 
TAGAAATGTC 
GCCCTCCTTC 
GCGTATCCGG 
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TTTCACATTG 
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TCGACGGGTG 
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GTTGCAGATC 
TTTAAACCTT 
GCAGCTTTTA 
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GTCTATCTAA 
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TACAATAATG 
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GATAGGTATA 
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GGTATCCAAA 
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TGGACTTTAA 
GACATAACAG 
ATTACTTTTA 
TTTCCTTTAA 
CAAATTACAA 
CCACCTTACA 
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AGGAGATCTT 
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GAGGGCTACT 
AGCTCAGTGG 
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GAGGTCACCT 
CAAAGCTTTC 
AAGGTAGTTC 
CTAAAAAAAA 
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GTAATAGCTG 
TCACATGGCT 
CGTTGGAATC 
TTCTGGTTTA 
ACAAGGATTT 
ACAATTAGCA 
AGATCTAGAA 
ATTAGCTCAA 
AAAAATTGTT 
TTTTTATGAT 
ATTTTCATCT 
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TTGTTATTAA 
CGGGTAGAAA 
AAAGTTTTTT 
TAACTACTGA 
ATGAAAACTT 
TTGGAAGTGC 
ATGGAGGGAC 
TGGATACTGA 
AAAACAAAGG 
AATCAGGAGA 
CTGGAAGGCT 
GTAGACCTCA 
ACACTACTGC 
TGCATCCAGA 
ATCGCCCAAA 
GTTCTAATGC 
ATGACAGAAA 
GATATTTTGC 
ATAATGAAGG 
ATTATTTCAT 
CAGCAAAAAC 
TGAATCTGAC 
CAGATAGGCT 
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Fig 1 (cont) 

GAAAACAACA CCACCCAACA TAAATCTACC TGATGACACC AACTCTTACG GATATATAAA 15960 
TGGAAGGGTC CCTCTAGCAA ACATAATAGA TACATGGACT AACATTGGGG CTAGGTGGTC 16020 
ATTAGATGTT ATGGATACTG TAAATCCATT TAATCACCAC AGAAATTCAG GACTAAAGTA 16080 
TAGGTCACAA CTGTTAGGAA ATGGAAGATA TTGCAGATTT CACATTCAAG TACCTCAAAA 16140 
ATTTTTTCCT ATAAAAAATC TTTTGTTGCT GCCAGGAACA TATAATTATG AATGGTACTT 16200 
TAGAAAGGAT CCCAACATGG TTTTTCAGTC TACTTTAGGT AACGACCTTA GAGCAGATGG 16260 
CGCAACTATT ACATACACCA ACATAAATTT ATATGTTTCA TTTTTCCCTA TGAATTATGA 16320 
AACAGTAAGT GAACTTGAAT TGATGTTGCG TAATGCTACT AATGATCAAA ACTTTGCAGA 16380 
TTATTTGGGT GCGGTAACTA ATCTTTATCA AATCCCAGCT AATACAAATA CTGTAGTAGT 16440 
GAACGTACCA GATAGATCTT GGGGTGCTTT CAGAGGATGG AGTTTCAATA GAATTAAAGC 16500 
TTCAGAAACA CCTATGATAG GAGCAACAAA AGATCCAAAT TTTACTTATT CAGGATCTAT 16560 
ACCGCTACTA GATGGTACTT TCTATTTAAC ACACACTTTT CAACGAGTTT CTATTCAGTG 16620 
GGATTCTAGC GTTCCATGGC CAGGAGATGA TAGGCTTTTG ATTCCAAATT GGTTTGAAAT 16680 
TAAGAGAGAT CCTAATATGG ACGCAGAAGG TTATACTATG AGTCAAAGTA CTATCACAAA 16740 
AGATTTTTAT TTGGTACAAA TGGCTGCTAA TTATAATCAA GCTTATCAAG GTTATAAATT 16800 
GCCAGTACAT TCTAAATATT ATGGATTTTT AGAAAATTTT CAACCTATGA GTCGCCAAGT 16860 
ACCAATTTAT GGTAATGGCA CTTATGATTT ATATACTGCT TATATTACAA ACCAAAGAAC 16920 
CATGCAAATT TGGAATAATA GTGGTTTAGA ATCTAAAACT TCAAATCCTC CTATGTTATC 16980 
CAACACTGGT CATCTTTATG TAGCTAACTG GCCATACCCT TTGATTGGAC CAAATGCTAT 17040 
TGAAAACCAA CAAACTGAAA GGAAATTTTT GTGTGATAAG TATATGTGGC AGATACCATT 17100 
TTCTAGTAAT TTTTTGAATA TGGGTAATTT AACAGATTTA GGGCAAAGTG TTTTGTACAC 17160 
TAATTCTAGT CATTCACTTA ATATGGTTTT TACTGTGGAT AGTATGCCTG AAACAACTTA 17220 
TCTAATGCTT TTATTTGGTG TTTTCGACCA AGTTGTTATT AATCAACCAA CAAGAAGTGG 17280 
AATAAGTGTA GCTTATTTGC GCCTTCCTTT TTCAGCTGGT AGTGCAGCAA CATGAGCGGC 17340 
ACATCCGAAA GTGAGCTGAA AAATCTGATT TCATCATTAC ATTTAAATAA TGGATTTTTG 17 400 
GGCATTTTTG ATTGCAGATT TCCAGGTTTT CTGCAAAAAT CTAAAATTCA AACTGCTATT 17460 
ATTAATACAG GTCCCAGAGA ACAAGGCGGA ATACACTGGA TAACATTAGC ATTAGAACCC 17520 
ATTTCTTATA AGCTATTTAT ATTTGATCCA CTCGGATGGA AAGACACTCA ATTAATTAAA 17580 
TTTTATAATT TTTCACTAAA TTCTCTTATT AAAAGGTCGG CCTTAAATAA CTCAGACAGA 17640 
TGTATTACAG TAGAAAGAAA TACTCAAAGT GTTCAATGTA CCTGTGCGGG ATCGTGCGGC 17700 
TTGTTTTGTA TATTTTTCTT ATACTGTTTT CACTTTTATA AACAAAATGT ATTTAAAAGT 17760 
TGGCTTTTTC AAAAATTAAA CGGTTCAACC CCTTCTCTGA TCCCATGTGA ACCACATCTA 17820 
TTACATGAAA ACCAGACATT TCTTTATGAT TTTTTAAATG CAAAAAGTGT TTATTTTCGA 17880 
AAAAATTATA GAACATTTAT TGAAAATACT AAGACTGGAT TAATAAAAAC ACATTAATTG 17940 
TATTCTTGCT TTTTGACGTT TTCATTAGTC TTCATCTTCA TCTTCTTCTT CACTGCTAGA 18000 
TTCCAAGATG GTTTTTTTTT TCTTTGATGG AGTAGGCTCT TCAATAGTTC CAAAAGGATT 18060 
CATATCAGAA TCCTCTTCTA TGTTAGGCAA CATAGTATTT TTAACCTGGA ATGACTGATT 18120 
CCACTTAAAT TGAGAAAACT GAAT.TGGAAT GTTATTTCCC ATACATTCAT TCCAAAATTT 18180 
ACGCACAAGA GTTAAACACT GTAACATATC TGGCAAGCTA ATTTTCATCT CACAAAATTT 18240 
TCCATTATTA CGTCTCAAGT TGTATTGATA GTTACAACAT TGAAACACAA AAACAGCAGG 18300 
GAATGTAACT GCTGCGGCCT GAACTCTATT AACATCCTGA ACATCAATTC CTTCCACTCC 18360 
AG A TAT AG AA AATGGAGTTA TTTTAGGGAG TTGTTTTCCT ATTGTTTGTT TGCCACCATA 18420 
ATTACATTCA CACTGACCCA ATATAAAAAG CATATTTCCG ACTTTAGCTT TCGGAAACAC 18480 
AGCTTTTGTA GTTTCAATGG CATTTTGCAT AGCCAGCAAG GCCTTCTTTT CATCTGAAAA 18540 
GTTAAGACCA CAACTGCGAG GAGAACATTG CCCAAAACGC TGATGGGCAT CCTCAGCACA 18600 
TAACACGTAA TGTTCCTGAA CTATTTTTAC TACTTGTTTA TTCATACGCC CATTACTAAG 18660 
AACACCCCTC CCTTCCTTTA GGGCTTGCAC CCCTGCTTCC GATGTTGGAG GCATTTCAAT 18720 
TTCATTCACC CTTTTAAACA TGAAGTCACC ATGAAAACAT CTAGGACGGT CCTCCTCCCA 18780 
ATCATGATAC CACAAATAAC AACCAGAAGC ATTAAAGTTT GGAATCAAGT CAATTTGCTT 18840 
ACAAATTGCA CTATATAGCA TTCTACCTCC TACAGTAGCC ATAGATTTAC TGCTACTATA 18900 
AGTCAAATTT ATAATTTTCA TCTTTTTCAT GTACTGAGCA AATAATTTTT CACAATCTCC 18960 
TTCTTCAGGA TGAAACTTCA TTTGACTGGT ATCAACTTTA ACACACTCTC CAAATTTAGC 19020 
TAAAATTTCG AGCGCCGCTT GAACTTTATT CTGAAATTCT TCTGTAGTAG ATTTTCTCTT 19080 
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Fig 1 (cont) 

CTTGATAGAT TTAGTAACTT TTTTAGAAGA CATTATGTTA GTTTTTTTCT CGTTGTAGGA 19140 
TGGCTGAAAA AAATATGGGA GAGTCAGAGA AGGGTTTGAA CGAAGAAGAA TTTAACTCTA 19200 
TTCTATCAAA ACATCTGGAA AGACAAATTA AAATCTGTAA AGCGTTAACA TCAAAATTAT 19260 
CGAACTGGAA TATTGGAACA TTGTTAGAAA ACTTGTTATT TTGTCCTGAT GAAAGACAAT 19320 
CATCAGGTGA TCCCGACCCA AAAC TAAACT TTTATCCGCC TTTTTTAATT CCGGAATGTC 19380 
TTGCATTGCA CTATCCATTT TTTCTAACAA CTCCTATTCC GCTATCATGC AAAGCGAACA 19440 
AAATAGGAAC TAACACTTAC CGAAAATGGA TGAACAATCA AGTCCTGGAT TTACAAATAC 19500 
CTTCCTTGGA AAATTGCAAA TGGGATGATA GCTTGGGAAA TGTAGATTTA ATTGAAGAGC 19560 
TTAAAGAGAA CCAAAAACTT GTTTTAGTAA AACAAGACCA TGAAAGAAAT ATATGGTTTA 19620 
AATCAAAATG CAAACAACTT CAAAGTTTCA GCTATCCCTC ACTCAGTCTG CCCCCAGTTT 19680 
TACAACAAGT TTTAATTGAA TCTCTTATCG GCATTAGTCA GGATCCTAAT AACTTTGACA 19740 
AAAATTACGA ACCTGCAATA ACTCTAGAAA AACTACAACA TGTAAACTGT GATCAAGATT 19800 
TAAAACAAGT TCAACAAAAA GTATCTTCAG CCGCTACATA CGGAATACTT TTGAAATGCA 19860 
TTCAGACTTT ATTCAGTGAC AAATTATTCA TTCAAAACTG CCAGGAATCA TTACATTACA 19920 
CCTTTAACCA TGGTTATGTA AAATTACTTC AATTTTTGAC AAATGTCAGT TTAAGCGAAT 19980 
TTGTAACTTT CCATGGTTTA ACACACAGGA ACAGACTCAA TAATCCGCAG CAACATACAC 20040 
AATTGGCAAC CGAAGACAAA ATAGACTATA TCATAGATAC AGTGTATTTA TTTTTGGTAT 20100 
TTACGTGGCA GACAGCAATG GATATTTGGA ATCAAACATT AGATGATAAA ACAATAAATA 20160 
TAATTAAAGA GGAATTAAAC CAAAATTTTG AGAAAATTGT CAAAGCTGAA TCAGTTGATG 20220 
AAGTTTCTGA AATTTTAAAG TCTATTATTT TCCCTGAACT CATGCTGCGA GCTTTTTGTT 20280 
CTAATTTACC TGATTTTATA AATCAGAGTC AG AT AT C AAA TTTTAGAAAC TTTATCTGCA 20340 
TTAAATCCGG CATACCGCAG TCAATTTGCC CCCTATTACC TTCAGATCTA ATTCCTTTAA 20400 
CTTTCCTAGA AAGTCATCCA ATACTCTGGA GTCATGTAAT GTTACTAAAT CTTGCTTCAT 20460 
TTCTAGTAAA CCAAGGCAAT TATTTGCATG AACCCGAAAA ACCTTTAAAT ATTTCATCAG 20520 
TTTACTGTAA TTGTAATTTA TGCTCTCCGC AAAGAATGCC ATGTTACAAT AGCAGTTTGA 20580 
TGCAAGAAAT ACTAACCATT GATAAATTCG AGTTCACAAA CTCTGATAAA ACAAAACAGC 20640 
TAAAACTGAC CCTCCAAACT TTTGCTAATG CCTATCTTAA CAAATTTAAC TCAGCAGAAT 20700 
TCTACCATGA CCAAGTTTTA TTCTACAAAA ACTGTAAAAG TAAATTTTCT AACCAATTAA 20760 
CAGCTTGTGT AATAAAAGAC GAAAAATTAT TGGCTAAAAT AGCAGAAATT CAAATAACGC 20820 
GGGAAAAAGA ACTCTTAAAA AGAGGAAAAG GAATTTATTT GGATCCAGAA AC AG GAG AAA 20880 
TCTTAAACAA TGGAGAAGCC ATATCATCCT CTGAAAACTT CCAAAGGCAA AGAACTAGCT 20940 
ATGCTCTACC ATCAAATGAA GGAGAGCGAG CTGGATGGGA AGCCGATGAG CGAAGAAGAC 21000 
GAAGGAGAAG TGAGTGAGGA TGAAACAGAG ACAACAATTC CAAAGAAAAT GAAGTTTACA 21060 
AGTAAGTAAG CTCTAAATTT TTTATATTAA AAACTGAATT TTTTTAGACA AAATTATTTT 21120 
AAATTAAATC TTTATAGCTA GCAGTTGATC TTTGTTCGTT TTTCAGAAAA CTCAAGTGTT 21180 
CAGTCATATC AAGTTCACTT GCCTCTGAAA CACGAAATTG CGGAAATTCT AGAAAAAATT 21240 
AGACTAGAAT CTAAAAAATA TCCAGGAAAA GTTTATCAAA TAAGAAATAG AACTCCAGCA 21300 
AGTATTACAA AACGATACCT GTATGAAAGA GATCTGAAGA AACTGTTCCA GTATCTAGAA 21360 
GACGCAAAGA AGCTTTACGC TAAGTACCAA AGCTGAGGCT TTATAGTTTT AAATTTTCCC 21420 
GCCATGGCTC AACCAGTGAC GCCTTACGTC TGGAAATACC AACCAGAAAC AGGATATACT 21480 
GCTGGAGCCC ATCAAAATTA TAACACTGTT ATCAACTGGT TGCATGCCAA TCCACAAATG 21540 
TTTGCCAGAA TTCAACATAT AAACACCGCA CGCAATGTTA TGGACAAATT CCGCTCTGAT 21600 
TTGACCCGAG ATGACATCGC GGTTAACATC AACAACTGGC CTGCAGAGGA TTTAATGCAA 21660 
CCTCCTAATT TTCCTTACAT TCCTGCGACC TCTAAATCCG CTTCAACCAT AAATGACTGG 21720 
TTGGCTACCA CTCAAGGAAT TCAACTCAGT GGAACTAGTG AACTAAACGG GTGGGGATCT 21780 
AACCGCCTGA CTTCCTATCC GGATATTCCA CCCATTTTAA AGTATGAAAG GCCTGGTCAA 21840 
CAACTTCAAG GCCAAGGACT TTTTAAGCAA GAAAATATTC ATTTATTTTA CGAATCTCCG 21900 
CGCCTCCCTC GCTCTGGAGG ATTAACTCCC CAACAATTTG TAAAAGAATT TCCGCCTGTT 21960 
GTTTATAATA ACCCCTTCTC AGAATCTATG AGTGTATTTC CGAAAGAATT TAGTCCTTTG 22020 
TTTAACCCTT CAGAATCTTT GAAAAAAACA TCCAGTCAAA CTTTACAATA TAAATAAAAA 22080 
ACTTCTATTG ATCTTTATAC TTACACTAAA GCATCGCGTT TATTTTCGTC GCCATAAAAA 22140 
TATATCAAAG ACCCGTAATT CTCTAACTTT AAATCATTTT TTGAACTAAT CTTAATCCAT 22200 
TTAAATGTAG GAATTAATAT ATCAGAAACC AGTAACAAGC CAGAATTAAA ATATACTTGT 22260 
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Fig 1 (cont) 



GTCATTTTTA 
ACTGGTTCCT 
CTTATCTTTA 
ATTGGGAGAT 
CAAAGTTCCC 
AACTGAGTCA 
GGGGAGTTTA 
AAATGTTTCA 
CCTAGGATTG 
AGAAGGAAAT 
CTTAAACTAT 
TCCATTAACT 
TTCAAATAAT 
TTTAACTGTA 
CTCAGCCCCA 
TGTAAGCGAT 
AAAACTTAGT 
TGTTAAAAAA 
CATAAAATTA 
CGATAACGCT 
CTTAAACCTT 
CTCTCTAGGG 
CGTTAACTTT 
AGCTTTCATT 
AGGTTTTCAA 
TGTAACTCAA 
ATCTACCATA 
TACAGCACAT 
TTACATACCG 
GATCATTGGA 
GAAAACCCCT 
AATCTTGCAC 
GGTTATAATC 
TTGTTATAAG 
TCCAAGGTAA 
GTTTTAAACT 
GCTGACAAGG 
AACTTTCCAA 
CTTCGCAACA 
AATTAAAGAA 
ACATTGCTGC 
AAATGTAATT 
TTGTGCATTT 
ACAGGAACAG 
GACAATAAAA 
TGATAATAAA 
ATAGGAAAAT 
TACACACGAA 
TGACTAGGAA 
AATCAGGCTT 
ACGTTTCATC 
CATTTAAAAC 
CAAAATTTGC 



CAGATGAAGC 
CTGCCTCCTT 
AACTTTACTG 
GGAATATTCA 
CTGACTGTCT 
GACTCTTTAG 
GTATTGAACT 
AATCCACTAA 
CAAAATGAAA 
TTAGGTATTA 
AAGAATCCTC 
GTTAATACAA 
GCTTTATCAT 
AATTTAACTA 
CTAGTGTCAT 
AATTCTTTAA 
GTAAACTATT 
CCTGTAATGA 
AATGATGCAG 
CTAAAACTGA 
GGAAACGGTT 
TTTAATGCTT 
CTAAGCACAA 
TATATTTTAT 
CCCCCACAAG 
ATTGTGGGAA 
ACTATGAAAT 
CAATTCAGAC 
TTCTTGACAT 
ATCTATAGAA 
TAAATCTACC 
TTCTGGACTT 
TGTTACAATT 
TACTGTAAAA 
AAAAACAGGC 
TTGACATTGC 
TAAGTCACAC 
GACTTTAAAA 
CATAATGGAG 
CAACAATACA 
AAAGTATCTG 
CGTTTAACAG 
GTAAGCTCCC 
TCTTAACGTT 
CACTTTTGGC 
ACTTATAAGC 
AACAAAAAAA 
TAGCAGAACC 
CAGATGGTTT 
AATAGGAAAA 
CTGTACATTA 
TCCCACCAAA 
CCATTTTAAA 



GAGCACGCTG 
TTATTGAAGC 
ATCCTATCAC 
TAAACGGAGA 
CAGATGAAAC 
CTTTAAAACA 
TAAATACTCC 
AGATAGCGGC 
GTTTGGGCTT 
AATTGAAAAA 
TCGCCATTAG 
GCGGATCTCT 
TATTTATAGG 
GGCCTCTGGT 
TGCAAGACAA 
GATTGTCTCT 
CTAATCCTTT 
TTAACAACAC 
AACAGTTGAC 
AACTTGGAAA 
TGACTTTCCA 
CTGGGGAATT 
CTCCTATAGC 
CAGGAACACA 
ACTTTTTGGA 
ATGATGTTAA 
TTACTTCTCC 
AATGAATATT 
AATGTGCCTC 
GCATAACTCT 
ATATTCATCT 
TTAAAAACAA 
TTACTTATTT 
TCATCAAATG 
ACACGAACAT 
AAAGAATTTG 
AAATACAACT 
CTAACAAACG 
TTCATGCTAC 
ACATACGAAG 
AACATTTACA 
TTTGATATGA 
AGAAACATTA 
TCGTTCAGAA 
AGCTAAACAT 
CATATCGGCC 
CTGATTATAT 
AAGACGACCA 
CTCACAAGCA 
GAAGAAAAAT 
CTAGTCACAA 
TTGTCCCAGT 
TAATCCAAAG 



GGACCCGGTT 
CGGAAAAGGG 
TATAAATCAA 
GGGTGGCCTA 
ATTGCAACTG 
ACCGCAACTT 
TTTAAATCTA 
AGATTCTTTA 
AAATCTAAGT 
TCCTATGAAA 
TAATGATGCG 
AGGAATATCT 
AAAACCTTTA 
ATGTCGTCAG 
TCTTACTTTA 
AAATTCTCCA 
AGTTGTGACT 
AGGTAATGTT 
TTTAGAAACC 
AGGCTTAACT 
ACAAGGTCTT 
ATCAACAGCT 
TTTTGGGTGG 
ATTTACTCCT 
TTTCTTCGTT 
GGTTATTGGC 
CTTAGCTGAA 
TTAAAAATTC 
TATAATTAAC 
TCCAATAAGC 
TTAAGTGTAC 
ACAGTACTTT 
CTTCTTCCAA 
AATAACATAA 
GAACTTTTTT 
GCTGCAAGCA 
TTAACAGCCT 
GTATATCACA 
ATCCAAAGAT 
AAAATTAAAA 
TTTTATACTT 
ATACCATTTT 
ACGGACAGGC 
AACAAAGTAA 
TGCAAAGATC 
CTCTTGCAAA 
ATGAATGGAG 
CGCCCAACAC 
ACAACTTTGA 
AATTTTCCCA 
ATACAACCTC 
CTACCTCAAA 
CATCAAATTC 



TATCCCTTTT 
CTAAAAAGCG 
ACCGGTTTCT 
TCAAGCACTG 
CTATTAAGTA 
CCCCTAAAAA 
CAAAATGAGA 
ACTATAAACT 
GATCCTATGA 
GTTGAAGAAA 
TTAAGTATAA 
TATTCTACTC 
GGATTAGGAA 
AACACTTTGG 
AGTTATGCTC 
CTAAACACAA 
GACTCTAATC 
GACTTAAGCT 
ACTGAGCCCT 
GTTAGTAATA 
TTACAAATTA 
ACAAAGCAGG 
CAAATAATAC 
CAATCCCCAG 
TTAAGTCCGT 
CTAACTATTT 
AATGTACCAG 
TTTATTAAAG 
AAATCTAAGC 
ATAATCATAT 
AGTATCTAAC 
CATAGGACAA 
TGGCAAAGCA 
CACATTTGTA 
TAAAATTAAC 
ATGACAATGA 
AAATATAACA 
ATAAAAAAGA 
GGTTCCGACA 
CGTTTTTCAA 
ATAAGCTCAC 
GAAGAAAAAT 
AAATCCAAGT 
CAGGCATATG 
CAGGTGAATT 
ACGAATCAGC 
TTAATATCTT 
AGGTAAATAT 
TTTGCTTATC 
ATAATAACGA 
CGCTATCAAA 
AAAGCCAGTT 
AGGAAACAAA 



CTGAAGAGAG 
AAGGGTTGAT 
TAACTGTAAA 
CTCCAAAAGT 
ATTCTCTAAC 
TAAATGATGA 
GATTGAGTTT 
TAAAGGAACC 
ATATAACTCC 
GTTCTTTAGC 
ACATTGCGAA 
CCTTACGAAT 
CTGACGGCTC 
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AATTGATTTT 
ACATTAATGT 
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AACCTCTGTA 
AACGAGATAT 
AAGTTTCAGA 
AGAAATAGTT 
ATTACAACAA 
ATTAAAGCAA 
ACAATGACAA 
TTTTTGGCTT 
CTTCAAATTA 
TTCAAGTCCA 
CATCACTGCC 
AAGAAATTCC 
GATTCCCTAT 
CCCATATTTT 
TCTTTCTGAG 
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Fig 1 (cont) 

CTAAAACATA TACAGTTTTA TCGCCATTAA ATCTAAAAGC CATCCTAAAT GGACCTCTAG 25500 
CCCAGTAGTT TAAGTACCGG GAAGAGACTA TACAATATAC TTGATATTGA TGTCTGTTAA 25560 
GTGGTGATAA AAAAGAAAGT AATTCAGAAT TAGGATAAAG CATTCTCCCA TGTTGATTCA 25620 
TCTACAAAAA ACAAAAAAAT. TATAAGGTTC ATAGAAAACC TACTATTTAA CAAATCTATA 25680 
AAAATGCATT AAAAAGTTAC CTTGAATATA AATTCAGATC ACCTAAAAAA CGAAAAAAAA 25740 
TAACATTTAT GTTAGTAAAT GATAGTCTTT AAAAATTAGA AAAGAATCAA GTCGCTTTTA 25800 
TACTTACAAA CTCCAAATAA ATTCTGTAAC CAAGAGAAAA ATTGTAACCT AAAAGGTAAA 25860 
GAAGAACATT ATAAGATTAA AACCACTCTA AAATCTGAAA AGCATTATGA AAAATTCTGA 25920 
TAGCTGCAAC TTACTAGTCT TCTCCAAATG TTGCAGGCAT TTCAAAAAAT CAAGAGGAAA 25980 
ACCGGAGTTT ATAAAGTAGT AGTCTGATTA TATCTGAAAA AGTTTAACTT CCTTTTCAAC 26040 
CCAACCCAGT CCAATAAAAT TCCAACCTTA ACTTCTTTCC TGCTAAAACT CCATAAAAGT 26100 
CCAATTACCA CTTGACTTTT ATTTAACCTC AATTATGTTA CATGTTATTC TACCCATAAA 26160 
AACTTGATGA CCAAGAACTG ACCTTTCCCA TGTTTTTCTG AAATAACAAA AATGTTGATT 26220 
TAAAGATTTT TAACTACCCA AAAAACCCGC TCTCATGATT TTTTCTTATA TAAACAGGAT 26280 
ACAAAAGAAC TGGCAAAGAT ATTCCATCAT ACTTCTCCAA CTGTCAAAAC ATACCACTTA 26340 
ACCTCTCCCA TGTTTTTTCC CTTTTGCACA AACAGGATAT AAAAAATATT TTTGCCACAA 26400 
TGTTTTTCCT TTTACTCAAC TGCCAGAATA AAAATGAACA GCTTAACCTT TTTCCCTCTT 26 460 
AACCCATTGC GTTCCTCTAA GAAAAAAATT ATCCCGCCCA ATATGCTAAA GGCTTCTCCC 26520 
GCCAAAACAG CTCAACTTAA AATCTCTCAT GAATAAAACC CAGAGAAAAT TTCCAGTAAT 26580 
AAAAATTAAT AACCGTGAAG TACTAGATCT AATAATGATA TTTTGAACTC ATAAAAATCC 26640 
ACCATCCATG TAATGTTACA AACACTTTTT TATTGAGTTT TTTCTTACAA CTGCATTACA 26700 
TACAGGCCAA GCATCAAACT TTCTTCTGTA TTTCTTCCTA GACCACAAAA TTACAGACTT 26760 
ATATTTCTGC CACAAATCTC TATGATCTTT ACAGTAACAC TTACATTTAA ATGGGGAATA 26820 
CAGCAGCAAA TAAGGATGAG TTAAACATGC GATACAATGA CCAGAAGGAA GATAATACAA 26880 
TACATCACAC CAAAATGAAG GTACAGACAA CATCGCATGA AATCTTAAAT GTGATTTTAC 26940 
AATAAATTTC TGCAGCAGCT TACAATCTAT ATTAGCAAAC CGTTTTATAT ACAAACATAA 27000 
AAACTTGGAA CTTTTCACCA ACTCAATCAT GTTATTATAA CACATTACAA ATTTTGCTAT 27060 
ATCTTTATTT GTCAAATAAC AAAATATCTC AATCCACAGC TCATCTGGCA GCAAACTTCG 27120 
CAAATCCATG ACCTGTAAAA GATACAACAG AAAACAGAAA ATTAATGCCA TTCAATAACA 27180 
TAAAAAATAC AGTCAAATCA CATACTTTTT CTCACTTACA AAACTTTGTG AGCAGGCCTC 27240 
CAAAACAAAC TTCAGAAAAT GGATGCATAC AAGAACATTC TCCTCTCAAA AATTGCTTTA 27300 
ACTGAATGCG GCATTTTGCA CCTCCAGAAA AATGCAGTCC ATTGAGAGGC TCTTCTCTTA 27360 
AAACACAGAA ATGCTTCTGC AAAATCTGTA AAGAAACTAA CAACTTCCAA ATTCCAATCA 27420 
TCATGCATTG CAAAGAAGGA CATTCAACAG CAAAAGGATC GTGATGAGCC AATAAAGCTT 27480 
TACTGTATGA CTCATTTTCA TGAATTACAG TCTGTAACTT ACTATAATGC ATTTTAAGCT 27540 
CTGCTTCACA AATTAATAAT GCTAATTTCT TTAAGCAGCT CAAAGAAAAC TCATCAGGAC 27600 
AACGGCATTT AAGAAAGCAA CAAAATGATT TCTTAAAATA CATTTTTCCA GCATGATGAA 27660 
CAATAAAAAA TTTCAACGTT AAACAATGCA AAAATGCATT TTTATGCACA GTGAAAGTAA 27 720 
TTTTTTCAGC TGAAGCTAAA TCACAGCCTA TTTTATTACA TGATTTTGTA TGCTCCAAAA 27 780 
GAGCTTGTTT TAATTGCTTC AAATCCATCT TCTTACAATT TTTTCTTTTT ATAAACACCA 27840 
GAACCGCATT CAGGCCAATT CCAGTTATTG TTTAAATTTG CTACAGAAAC TGCAGACCAC 27900 
AAAACCACAT CCTCTAAATC AACCCACAAA GATCTATGAT CCACACAAAA ACACAAAGAA 279 60 
TGATACGGAG AATACAACAA TAAATGGGGA TTAACAAGGG ACGCAACACA ATGACCCGAA 28020 
GGTAATAAAG TTTTACAGCA CCAATTACAA GCAACAGGTA ATGGAGTATA TTTCCCAATG 28080 
CGACGAGAAA GCCGAATGTC ATTCAGAACA GCATTGCATT TTATCTTCTC AAACCTCTTA 28140 
AGGTGCAATT GTATAAAATA AGAATCCTTA ATGACAGTGA TGAATTGAGG AAAAGCAAAA 28200 
ACAAAACTAG CAATGTCTTT GCTTGTAAGT TTCAAAAATA TCTTCATCCA AATCTCAGTC 28260 
GGTAATTCAA CAAAAAATTC AGGGGCCTAC AAAATTAATC AGACTAATTT AATATCATCT 28320 
TGTAAACAGC GAAAAGAAAA AATAACACAC CCAAAAATAA AAAACTCTTA CCCCTGTTAT 28380 
CCATCGAGAT AC AC AG AAAA ATTCAGAACA CTCAGTGTCA TGTTTCTTAA ATTGTTCCCA 28 4 40 
AAGCTCAGAC ATTCTAAGCC AAAAATTTTT TGAGAACTGC AAAAACCCAG TTTTTATAAC 28500 
AAAGCCTTAA TGTTTTCTTA ACTGATTTAA CTGCCCTAAC AGGAACTCCA CATTCCGGCC 28560 
ACCGCCACCC AGGGGACAAA TCTTGCCAAG AACTACAAGT CCATAAAACA ACATCCTGCA 28620 
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Fig 1 (cont) 



AATTATACCA AAGGTTTCTA TGGTCGACAC AATTACAACC TGACCTAAAA GGTGAATAAA 28680 
GCAGTAAATA AGGATGAGTT AAACAGGCCA CACAATGTCC AGAATGTAAA AAATGCTTTG 28760 
TTTGGCACCA ACCAGACCAC AGCTGAAGCA AAGGAAAATT GTAGCGAACA CATTCTTCTC 28800 
GTAATCTGTT TAACACAGAA CAACATTCAA TTCTGGCAAA CCTCTTTAAA AAATGTTTTC 28860 
TGAAATATTT CTTTAAAATG ACAGTTTGCA ACTCTGGAAA ACACAAAATA AAAGCCGCAA 28920 
TATCTCTACT GCTTAAATAT AAAAATATCA TTGTCCAAAT TTCTACTGGT AAAACTGAAA 28980 
GCATCTTCTT CCTATTAAAA AAAGAAAAGT GTTTTCAAAT TATATTAGAC TCTAACCAAA 29040 
AAAATTCAAA TACTTTTCCT TTATAATGTA CATTAAGAAT AAAAATATAC TCACCGTTTA 29100 
AAAGTAGAAC TTAACAGTAT AATATAAATA CAAGTGAGCT GAACAACGAC AGCCGATTTC 29160 
AGCCGGAGCA AAATTAAAAA GAATAAAAGG ATCAAACCAA CACGTAGGAC AGTCTACTCC 29220 
AAAAC AG TAA CGGCAGTATG ACACAGAAGG AGAGGAACTA AGTCCAGGAA ACTTCGCCCG 29280 
GTGCGATAAA AAGTAACGCC GCCGGAAAGC AGTTGAATAC AAAAGAGGTA AAAATTCACG 293 AO 
AAAAACAGAA GCAAAAACTA CTAAATCTGC TATTGGCAAA TAAAGAAAAA TTTCAAACCA 29400 
TATTTCCAAA G G AAG AAAAG CAATCATACC GTAGAAGAAC CTGAAGGCGA CCGCAAACGT 29460 
GCTCCCGTAC CACAACGTCA CACGCCACAC CCACTGGGAA AACCCACACG CCCCGCCTCT 29520 
GTGCAACGTT ATATATATGA AT AG 29544 



WO 96/03508 



PCT/AU95/00453 



11/23 



<D 




t 

O 
O 



4i 

£ > 



♦ l A 4 

X Q. 4 — 

Q- T> 



1= 



in 
in 



CD 
< 

a =4 



^ a. 

•S CD 



a. 

Q. 



in 



03 
> 



CM 
Q. 



3 
ft 



It 



CM 



# 



WO 96/03508 PCT/AU95/00453 



12/23 




WO 96/03508 



PCT/AU95/00453 



13/23 



o 



o 




^ CM P) 



-O 

JSC 
CD 

CO 
CNJ 



CNJ 

CO 
CNJ 



00 

CVJ 
CVJ 



CVJ 
CNJ 



o 

CNJ 
CNJ 



CD 
CNJ 

CNJ 
CNJ 



O 

CNJ 



a. 

c 
CO 



> 



CP 



o 
in 
c 

CO 



WO 96/03508 



PCI7AU95/00453 



14/23 



CC 





Fig. 6 



WO 96/03508 



PCT/AU95/00453 



16/23 





Fig- 7 



WO 96/03508 



• 

PCT/AU95/00453 



17/23 



wtOAV287 OAV100 OAV200 wtOAV287 

- + - + SpHI UAEN UAEN 




12345 123456789 10 

(A) (B) 



Fig. 8 



WO 96/03508 



PCI7AU95/00453 



18/23 



Tandem 



Notl P s!gnal BgiM Clal Xhol Hindlll 

3' CGCCGGCGAAATAAAAATAATTCTAGACTATAGCTAGAGCTCTTCGAA £ 




Fig. 9 



WO 96/03508 



PCT/AU95/00453 



20/23 




FIGURE 1 1 A 



SUBSTITUTE SHEET (Rule 26) 



WO 96/03508 



* — m 



PCT/AU9S/00453 



21/23 



36hrp.i. 48hrp.i. 60hrp.i. 
CSL503 J BOVINE TURBINATE | 



u luiu 




Infection with OAV204 (MLP/VP7sc) 
FIGURE 11B 



SUBSTITUTE SHEET (Rule 26) 



WO 96/03508 



PCT/AU95/00453 



22/23 



48hr pi 

OVINE 
CSL503 

I U M 




OAV206 (HCMV/VP7SC) 



FIGURE 12A 



SUBSTITUTE SHEET (Rule 



WO 96/03508 



PCT/AU95/00453 



23/23 



48hr pi 



RABBIT KIDNEY) BOV TURBS 



U 



u 



M 





OAY206 (HCMV/YP7SC) 



FIGURE 12B 



SUBSTITUTE SHEET (Rule 



INTERNATIONAL SEARCH REPORT 



International Application No. 
PCT/AU 9S/004S3 



CLASSIFICATION OF SUBJECT MATTER 



Int Cl &: C12N 15/34, 7/01, 15/86; A6IK 48/00, 39/235 



According to International Patent Classification (IPC) or to both national classification and IPC 



FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
IPC 6 : C12N 15/34, 7/01, 15/86 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 
AU: IPC as above 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
DERWENT: ovine, sheep, ewe, adenovirus, adeno (w) virus, vector, trichostrongylus or colutoriformis or taenia or 

ovis or lucilla or cuprina 
CHEMICAL ABSTRACTS: STN sequence search 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



P,X 



Citation of document, with indication, where appropriate, of the relevant passages 



VETERINARY MICROBIOLOGY, Volume 41, No. 3 (1994), BOYLE, D B et al. 
"Characterization of Australian ovine adenovirus isolates", pages 281-291 
whole document 

NICHOLSON, B H "Synthetic Vaccines" published 1994 by Blackwell Scientific 
Publications (Oxford, UK) pages 346-361 
whole document 

JOURNAL OF APPLIED PHYSIOLOGY, Volume 76. No. 6 (1994) 
LEMARCHAND, P, et al. "In vivo adenovirus-mediated gene transfer to lungs via 
pulmonary artery", pages 2840-2845 
whole document 



(~j Further document* ire lined in the continuation of Box C 



Relevant to claim No. 



1-23 



1-23 



1-23 



□ 



See patent family 



"L" 



Special categories of cited documents: 

document defining the general state of the art which is 
not considered to be of particular relevance 
earlier document but published on or after the 
international filing date 

document which may throw doubts on priority claim(s) 
or which is cited to establish the publication date of 
another citation or other special reason (as specified) 
document referring to an oral disclosure, use, 
exhibition or other means 

document published prior to the international filing 
date but later than the priority date claimed 



"Y" 



later document published after the international filing date or 
priority date and not in conflict with the application but cited to 
understand the principle or theory underlying the invention 
document of particular relevance; the claimed invention cannot 
be considered novel or cannot be considered to involve an 
inventive step when the document is taken alone 
document of particular relevance; the claimed invention cannot 
be considered to involve an inventive step when the document is 
combined with one or more other such documents, such 
combination being obvious to a person skilled in the art 
document member of the same patent family 



Date of the actual completion of the international search 



26 October 1995 



Name and mailing address of the ISA/A U 

AUSTRALIAN INDUSTRIAL PROPERTY ORGANISATION 

PO BOX 200 

WODEN ACT 2606 

AUSTRALIA Facsimile No.: (06) 285 3929 



Date of mailing of the international search report 



Authorized officer 



JENNIFER POTTER- 

Telephone No.: (06) 283 2247 




Form PCT/ISA/210 (second sheet) (July 1992) copbko 



